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About This Manual 


Preface 


Read This First 


This user’s guide serves as an applications reference book for the TMS320C3x 
generation of digital signal processors (DSPs). These include the TMS320C30, 
TMS320C31, TMS320LC31, TMS320C32, and TMS320VC33. Throughout the 
book, all references to 'C3x refer collectively to the C30, 'C31/VC33, ’LC31 and 
C32. 


This book provides information to assist managers and hardware/software 
engineers in application development. It includes example code and hard- 
ware connections for various applications. 


The guide shows how to use the instructions set, the architecture, and the ’C3x 
interface. It presents examples for frequently used applications and discusses 
more involved examples and applications. It also defines the principles involved 
in many applications and gives the corresponding assembly language code for 
instructional purposes and for immediate use. Whenever the detailed explanation 
of the underlying theory is too extensive to be included in this manual, appropriate 
references are given for further information. 


Notational Conventions 


This document uses the following conventions. 


J Program listings, program examples, and interactive displays are shown in 
aspecial typeface. Examples use a bold version of the special 
typeface for emphasis; interactive displays use a bold version of the 
special typeface to distinguish commands that you enter from items that the 
system displays (Such as prompts, command output, error messages, etc.). 


Here is a sample program listing: 


0011 0005 0001 .field Ly, 2 
0012 0005 0003 .field 3, 4 
0013 0005 0006 .field 6, 3 


0014 0006 .even 


Notational Conventions 


Here is an example of a system prompt and a command that you might 
enter: 


C: esr -a /user/ti/simuboard/utilities 


In syntax descriptions, the instruction, command, or directive is in bold 
typeface and parameters are in an italic typeface. Portions of a syntax that 
are in bold must be entered as shown; portions of a syntax that are in italics 
describe the type of information that must be entered. Here is an example 
of a directive syntax: 


-asect section name”, address 


The directive .asect has two parameters, indicated by section name and 
address. When you use .asect, the first parameter is an actual section 
name, enclosed in double quotes; the second parameter is an address. 


Square brackets ( [ and ] ) identify an optional parameter. If you use an 
optional parameter, you specify the information within the brackets; you 
do not enter the brackets themselves. Here is an example of an instruction 
that has an optional parameter: 


LALK 16-bit constant [, shift] 


The LALK instruction has two parameters. The first parameter, 16-bit 
constant, is required. The second parameter, shift, is optional. As this syntax 
shows, if you use the optional second parameter, you must precede it with a 
comma. 


Square brackets are also used as part of the pathname specification for 
VMS pathnames; in this case, the brackets are actually part of the path- 
name (they are not optional). 


Braces ( {and} ) indicate a list. The symbol | (read as or) separates items 
within the list. Here is an example of a list: 


{* | ** | } 
This provides three choices: *, *+, or *-. 


Unless the list is enclosed in square brackets, you must choose one item 
from the list. 


Some directives can have a varying number of parameters. For example, 
the .byte directive can have up to 100 parameters. The syntax for this direc- 
tive is: 


-byte value; [, ..., value] 


This syntax shows that .byte has at least one value parameter, but you 
may supply additional value parameters, separated by commas. 


Information About Cautions / Related Documentation from Texas Instruments 


Information About Cautions 


This book contains cautions. 


This is an example of a caution statement. 


A caution statement describes a situation that could potentially 
damage your software or equipment. 


The information in a caution is provided for your protection. Please read each 
caution carefully. 


Related Documentation From Texas Instruments 


The following books describe the TMS320 floating-point devices and related 
support tools. To obtain a copy of any of these Tl documents, call the Texas 
Instruments Literature Response Center as indicated in the section /f You 
Need Assistance... on page vi. When ordering, please identify the book by its 
title and literature number. 


TMS320C3x General Purpose Applications User’s Guide (literature number 
SPRU194) provides information to assist you in application development 
for the TMS320C3x generation of digital signal processors (DSPs). It 
includes example code and hardware connections for various appliances. 
It also defines the principles involved in many applications and gives the 
corresponding assembly language code for instructional purposes and for 
immediate use. 


TMS320C3x/C4x Assembly Language Tools User’s Guide (literature 
number SPRU0O35) describes the assembly language tools (assembler, 
linker, and other tools used to develop assembly language code), 
assembler directives, macros, common object file format, and symbolic 
debugging directives for the ’C3x and ’C4x generations of devices. 


TMS320C3x/C4x Optimizing C Compiler User’s Guide (literature number 
SPRU034) describes the TMS320 floating-point C compiler. This C compiler 
accepts ANSI standard C source code and produces TMS320 assembly 
language source code for the ’C3x and ’C4x generations of devices. 


Read This First V 


Related Documentation from Texas Instruments / References 


References 


vi 


TMS320C3x C Source Debugger User’s Guide (literature number 
SPRUO053) tells you how to invoke the ’C3x emulator, evaluation module, 
and simulator versions of the C source debugger interface. This book 
discusses various aspects of the debugger interface, including window 
management, command eniry, code execution, data management, and 
breakpoints. It also includes a tutorial that introduces basic debugger 
functionality. 


TMS320 DSP Development Support Reference Guide (literature number 
SPRUO011) describes the TMS320 family of digital signal processors and 
the tools that support these devices. Included are code-generation tools 
(compilers, assemblers, linkers, etc.) and system integration and debug 
tools (simulators, emulators, evaluation modules, etc.). Also covered are 
available documentation, seminars, the university program, and factory 
repair and exchange. 


TMS320 Third-Party Support Reference Guide (literature number 
SPRU052) alphabetically lists over 100 third parties that provide various 
products that serve the family of TMS320 digital signal processors. A 
myriad of products and applications are offered—software and hardware 
development tools, speech recognition, image processing, noise can- 
cellation, modems, etc. 


The publications in the following reference list contain useful information 
regarding functions, operations, and applications of digital signal processing 
(DSP). These books also provide other references to many useful technical 
papers. The reference list is organized into categories of general DSP, speech, 
image processing, and digital control theory and is alphabetized by author. 


Lj General Digital Signal Processing 


Antoniou, Andreas, Digital Filters: Analysis and Design. New York, NY: 
McGraw-Hill Company, Inc., 1979. 


Bateman, A., and Yates, W., Digital Signal Processing Design. Salt Lake 
City, Utah: W. H. Freeman and Company, 1990. 


Brigham, E. Oran, The Fast Fourier Transform. Englewood Cliffs, NJ: 
Prentice-Hall, Inc., 1974. 


Burrus, C.S., and Parks, T.W., DFT/FFT and Convolution Algorithms. New 
York, NY: John Wiley and Sons, Inc., 1984. 


Chassaing, R., and Horning, D., Digital Signal Processing with the 
TMS320C25. New York, NY: John Wiley and Sons, Inc., 1990. 


Digital Signal Processing Applications with the TMS320 Family, Vol. |. Tex- 
as Instruments, 1986; Prentice-Hall, Inc., 1987. 
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Jackson, Leland B., Digital Filters and Signal Processing. Hingham, MA: 
Kluwer Academic Publishers, 1986. 


Jones, D.L., and Parks, T.W., A Digital Signal Processing Laboratory Using 
the TMS32010. Englewood Cliffs, NJ: Prentice-Hall, Inc., 1987. 
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Chapter 1 


Introduction 


The TMS320C8x generation of digital signal processors (DSPs) are high- 
performance CMOS 32-bit floating-point devices in the TMS320 family of 
single-chip DSPs. 


The ’C3x generation integrates both system control and math-intensive functions 
on a single controller. This system integration allows fast, easy data movement 
and high-speed numeric processing performance. Extensive internal busing and 
a powerful DSP instruction set provide the devices with the speed and flexibility 
to execute at up to 60 million floating-point operations per second (MFLOPS). 
The devices also feature a high degree of on-chip parallelism that allows users 
to perform up to 11 operations in a single instruction. 
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TMS320C3x Devices 


1.1 TMS320C3x Devices 


The ’C3x family consists of three members: the ’C30, ’C31, ’C32, and ’VC33. 
The ’C30, ’C31, C32, and ’VC33 can perform parallel multiply and arithmetic 
logic unit (ALU) operations on integer or floating-point data in a single cycle. 


The processors also possess the following features for high performance and 
ease of use: 


Te 


Note: VC33 


The TMS320VC33 is logically and functionally a superset of the TMS320C31 
making nearly all literature and documentation for the TMS320C31 applica— 
ble. However, the electrical characteristics are different and are covered in 
detail in the TMS320VC33 data sheet (literature number SPRS087). 
a | 
General-purpose register file 
Program cache 
Dedicated auxiliary register arithmetic units (ARAU) 


Internal dual-access memories 


tw oo 


One direct memory access (DMA) channel (a two-channel DMA on the 
TMS320C32) supporting concurrent I/O 


4) Short machine-cycle time 


General-purpose applications are greatly enhanced by the large address 
space, multiprocessor interface, internally and externally generated wait states, 
two external interface ports (one on the C31 and the ’C32) two timers, two serial 
ports (one on the ’C31, ’VC33, and the ’C32), and multiple-interrupt structure. 
The ’C3x supports a wide variety of system applications from host processor to 
dedicated coprocessor. 


High-level language is implemented more easily through a register-based archi- 
tecture, large address space, powerful addressing modes, flexible instruction 
set, and well-supported floating-point arithmetic. 


Figure 1-1 shows a block diagram of ’C3x devices. 


Figure 1-1. TMS320C3x Devices Block Diagram 
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TMS320C3x Key Specifications 


The key specifications of the ’C3x devices include the following: 


TMS320C30 


OHOUOUCU 


Performance up to 60 MFLOPS 
Highly efficient C language engine 

Large address space: 16M words x 32 bits 
Fast memory management with on-chip DMA 
Industry-exclusive 3-V versions available on some devices 


Program access 
32-bit (’C30-’C31) 
16/32-bit (’C32) 


A23-0 

STRBO_B3-0 (C32) 

STRB1_B3-0 (C32) 
IOSTRB (C32) 
PRGW ('C32) 


[inert Y=) ro 


SerialportO0 {_» 


K Serial port 1 (C30) K _ » 


The ’C30 is the first member of the ’C3x generation. It differs from the 
°C31/VC33 and ’C32 by offering 4K ROM, 2K RAM, a second serial port, and 


a second external bus. 


TMS320C31 and TMS320LC31 


The ’C31 and ’LC31 are the second members of the ’C3x generation. They are 
low-cost 32-bit floating-point DSPs which have a boot-loader program, 2K RAM, 
single external port, single serial port, and are available in 3.3-V operation 
(LC31). 
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1.1.4 TMS320C32 


1.1.5 TMS320VC33 


The ’C32 is the newest member of the ’'C3x generation. They are enhanced 
versions of the ’C3x family and the lowest cost floating-point processors on the 
market today. These enhancements include a variable-width memory inter- 
face, two-channel DMA coprocessor with configurable priorities, flexible boot 
loader, and a relocatable interrupt vector table. 


The .VC33 is the fourth member of the .C3x generation. This is a low-cost 
32-bit floating-point DSPs which has a boot-loader program, 34K RAM, single 
external port, single serial port, and is fabricated in an ultra-low power 
(3.3-V/1.8-V) CMOS process. 
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Typical Applications 


1.2 Typical Applications 


The TMS320 family’s versatility, realtime performance, and multiple functions 
offer flexible design approaches in a variety of applications, which are shown 
in Table 1-2. 


Table 1-2. Typical Applications of the TMS320 Family 


General-Purpose DSP 
Digital filtering 


Graphics/Imaging 


Instrumentation 


3-D transformations rendering 


Convolution 

Correlation 

Hilbert transforms 

Fast Fourier transforms 
Adaptive filtering 
Windowing 

Waveform generation 


Robot vision 

Image transmission/compression 
Pattern recognition 

Image enhancement 
Homomorphic processing 
Workstations 

Animation/digital map 


Spectrum analysis 
Function generation 
Pattern matching 
Seismic processing 
Transient analysis 
Digital filtering 
Phase-locked loops 


Bar-code scanners 


Voice/Speech 
Voice mail 


Control 
Disk control 


Military 


Secure communications 


Speech vocoding 
Speech recognition 
Speaker verification 
Speech enhancement 
Speech synthesis 
Text-to-speech 
Neural networks 


Servo control 

Robot control 

Laser printer control 
Engine control 
Motor control 
Kalman filtering 


Radar processing 

Sonar processing 

Image processing 
Navigation 

Missile guidance 

Radio frequency modems 
Sensor fusion 


Telecommunications 


Echo cancellation 
ADPCM transcoders 
Digital PBXs 

Line repeaters 

Channel multiplexing 
Modems 

Adaptive equalizers 
DTMF encoding/decoding 
Data encryption 


FAX 

Cellular telephones 
Speaker phones 
Digital speech 
Interpolation (DSI) 
X.25 packet switching 
Video conferencing 
Spread spectrum 
Communications 


Automotive 


Engine control 
Vibration analysis 
Antiskid brakes 
Anticollision 
Adaptive ride control 
Global positioning 
Navigation 

Voice commands 
Digital radio 


Cellular telephones 


Consumer 


Industrial 


Robotics 


Medical 


Radar detectors 


Power tools 

Digital audio/TV 
Music synthesizer 
Toys and games 
Solid-state answering 
Machines 


Numeric control 
Security access 
Power line monitors 
Visual inspection 
Lathe control 

CAM 


Hearing aids 

Patient monitoring 
Ultrasound equipment 
Diagnostic tools 
Prosthetics 

Fetal monitors 


MR imaging 
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Architectural Overview 


This chapter provides an architectural overview of the ‘C3x processor. It includes 
a discussion of the CPU, memory interface, boot loader, peripherals, and direct 
memory access (DMA) of the ’C3x processor. 
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2.1 Overview 


2-2 


The ’C3x architecture responds to system demands that are based on sophisti- 
cated arithmetic algorithms that emphasize both hardware and software solu- 
tions. High performance is achieved through the precision and wide dynamic 
range of the floating-point units, large on-chip memory, a high degree of parallel- 
ism, and the DMA controller. 


eS | 


Note: VC33 


The TMS320VC33 is logically and functionally a superset of the TMS320C31 
making nearly all literature and documentation for the TMS320C31 applica— 
ble. However, the electrical characteristics are different and are covered in 
detail in the TMS320VC33 data sheet (literature number SPRS087). 


ss | 


Figure 2-1 through Figure 2-3 show functional block diagrams of the ’C30, 
’°C31/VC33, and ’C32 architectures, respectively. 
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Figure 2-1. TMS320C30 Block Diagram 


RAM 
Cache block 1 
(64 X 32) (1K x 32) 
PDATA bus XRDY 
MSTRB 
PADDR bus lIOSTRB 
—RDY —> 4 v v XR/W 
HOLD —y | 5 | DDATA bus 3 XD31-XD¢ 
HOLDA 47, | # 6 XA12-XAC 
STRB ¢X_)| &|_DADDA1 bus 2 
RW, 18 La 2 
D31-Do «| | = |_DADDR2 bus 
A23-A0 42 
DMADATA bus 
DMAADDR bus 
32 v Y 24 4 Serial port 0 
Port-control 
DXO 
R/X timer CLKXO 
|__| register FSRO 
5 Data-transmit DRO 
se CLKRO 
Data-receive 
RESET —> register 
INT3—0 —> Destination- 
IACK¢—| address _ Serial port 1 
MC/MP —>} register 
XF(1,0 i 2 Port-control FSX1 
(1,0) Transfer- S a register Dxt 
Vpp(s-0) =| Iau counter a 2 - 
lODVpp (1,0) —» iB cx) |G] 32 register g 3 R/Xtimer CLKX1 
ADVpp(1,0) =) I lb Qo} |3 register FSR1 
PDVppD S) O} 19 fs i Data-transmit DR1 
DDVpp(1,0) 7 te} loc shifter s| |5 register CLKR1 
MDV, i] = ra 
Pa 3 ALU © 5 Data-receive 
Vss(3-0) 2 a register 
DVgs(3-0) g 
CVgs(1,0) ia Extended- 40 
Vss precision < 
VBBP registers Global-control 
SUBS (R7-RO) register 
x1 Timer-period TCLKO 
X2/CLKIN register 
H1 : 
a 
EMU6-0 register 
RSV10-0 
SHZ ¢ 
Global-control 
i register 
Auxiliary 
registers Timer-period TCLK1 
(ARO-AR7) register 
tl 
register 
PDATA bus — program data bus Other 
PADDR bus - program address bus | ie ar Port control 
DDATA bus - data data bus ¢ 
DADDR1 bus - data address 1 bus 
Expansion 
DADDR2 bus - data address 2 bus | exansion | 
NON 


Architectural Overview 2-3 


Overview 


Figure 2-2. TMS320C31 Block Diagram 
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Figure 2-3. TMS320C32 Block Diagram 
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Central Processing Unit (CPU) 


2.2 Central Processing Unit (CPU) 
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The ’C8x devices (’C30, ’C31/VC33, and ’C32) have a register-based CPU ar- 
chitecture. The CPU consists of the following components: 


Floating-point/integer multiplier 

Arithmetic logic unit (ALU) 

32-bit barrel shifter 

Internal buses (CPU1/CPU2 and REG1/REG2) 
Auxiliary register arithmetic units (ARAUs) 
CPU register file 


DOO OU 


Figure 2-4 shows a diagram of the various CPU components. 


Central Processing Unit (CPU) 


Figure 2-4. Central Processing Unit (CPU) 
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Tt Disp = an 8-bit integer displacement carried in a program-control instruction 
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Central Processing Unit (CPU) 


2.2.1 


Floating-Point/Integer Multiplier 


The multiplier performs single-cycle multiplications on 24-bit integer and 32-bit 
floating-point values. The ‘C3x implementation of floating-point arithmetic allows 
for floating-point or fixed-point operations at speeds up to 33-ns per instruction 
cycle. To gain even higher throughput, you can use parallel instructions to perform 
a multiply and an ALU operation in a single cycle. 


When the multiplier performs floating-point multiplication, the inputs are 32-bit 
floating-point numbers, and the result is a 40-bit floating-point number. When 
the multiplier performs integer multiplication, the input data is 24 bits and yields 
a 32-bit result. See Chapter 5, Data Formats and Floating-Point Operation, for 
detailed information. 


2.2.2 Arithmetic Logic Unit (ALU) and Internal Buses 


The ALU performs single-cycle operations on 32-bit integer, 32-bit logical, 
and 40-bit floating-point data, including single-cycle integer and floating- 
point conversions. Results of the ALU are always maintained in 32-bit integer 
or 40-bit floating-point formats. The barrel shifter is used to shift up to 32 bits 
left or right in a single cycle. See Chapter 5, Data Formats and Floating-Point 
Operation, for detailed information. 


Four internal buses, CPU1, CPU2, REG1, and REG2 carry two operands from 
memory and two operands from the register file, allowing parallel multiplies 
and adds/subtracts on four integer or floating-point operands in a single cycle. 


2.2.3 Auxiliary Register Arithmetic Units (ARAUs) 
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Two auxiliary register arithmetic units (ARAUO and ARAU1) can generate two 
addresses in a single cycle. The ARAUs operate in parallel with the multiplier 
and ALU. They support addressing with displacements, index registers (IRO and 
IR1), and circular and bit-reversed addressing. See Chapter 6, Addressing 
Modes, for more information. 


CPU Primary Register File 


2.3 CPU Primary Register File 


The ’C3x provides 28 registers in a multiport register file that is tightly coupled 
to the CPU. Table 2-1 lists the register names and functions. 


All of the primary registers can be operated upon by the multiplier and ALU and 
can be used as general-purpose registers. The registers also have some special 
functions. For example, the eight extended-precision registers are especially 
suited for maintaining extended-precision floating-point results. The eight auxiliary 
registers support a variety of indirect addressing modes and can be used as 
general-purpose 32-bit integer and logical registers. The remaining registers 
provide such system functions as addressing, stack management, processor 
status, interrupts, and block repeat. See Chapter 3, CPU Registers, for more 


information. 
Table 2-1. Primary CPU Registers 

Register 

Name Assigned Function Section Page 
RO Extended-precision register 0 3.1.1 3-3 
R1 Extended-precision register 1 3.1.1 3-3 
R2 Extended-precision register 2 3.1.1 3-3 
R3 Extended-precision register 3 3.1.1 3-3 
R4 Extended-precision register 4 3.1.1 3-3 
R5 Extended-precision register 5 3.1.1 3-3 
R6 Extended-precision register 6 3.1.1 3-3 
R7 Extended-precision register 7 3.1.1 3-3 
ARO Auxiliary register 0 3.1.2 3-4 
AR1 Auxiliary register 1 3.1.2 3-4 
AR2 Auxiliary register 2 3.1.2 3-4 
AR3 Auxiliary register 3 3.1.2 3-4 
AR4 Auxiliary register 4 3.1.2 3-4 
AR5 Auxiliary register 5 3.1.2 3-4 
AR6 Auxiliary register 6 3.1.2 3-4 
AR7 Auxiliary register 7 3.1.2 3-4 
DP Data-page pointer 3.1.3 3-4 
IRO Index register 0 3.1.4 3-4 
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CPU Primary Register File 


Table 2-1. Primary CPU Registers (Continued) 


Register 

Name Assigned Function Section Page 
IR1 Index register 1 3.1.4 3-4 
BK Block-size register 3.1.5 3-4 
SP System-stack pointer 3.1.6 3-4 
ST Status register S47 3-5 
IE CPU/DMA interrupt-enable regis- 3.1.8 3-9 

ter 

IF CPU interrupt flag 3.1.9 3-11 
IOF I/O flag 3.1.10 3-16 
RS Repeat start-address 3.1.11 3-17 
RE Repeat end-address 3.1.11 3-17 
RC Repeat counter 3.1.11 3-17 


The extended-precision registers (R7—RO) can store and support operations 
on 32-bit integers and 40-bit floating-point numbers. Any instruction that assumes 
the operands are floating-point numbers uses bits 39-0. If the operands are 
either signed or unsigned integers, only bits 31-0 are used; bits 39-32 remain 
unchanged. This is true for all shift operations. See Chapter 5, Data Formats and 
Floating-Point Operation, for extended-precision register formats for floating- 
point and integer numbers. 


The 32-bit auxiliary registers (AR7-ARO) are accessed by the CPU and 
modified by the two ARAUs. The primary function of the auxiliary registers is 
the generation of 24-bit addresses. They also can be used as loop counters 
or as 32-bit general-purpose registers that are modified by the multiplier and 
ALU. See Chapter 6, Addressing Modes, for detailed information and examples 
of the use of auxiliary registers in addressing. 


The data-page pointer (DP) is a 32-bit register. The eight least significant bits 
(LSBs) of the data-page pointer are used by the direct addressing mode as a 
pointer to the page of data being addressed. Data pages are 64K words long, 
with a total of 256 pages. 


The 32-bit index registers (IRO, IR1) contain the value used by the ARAU to 
compute an indexed address. See Chapter 6, Addressing Modes, for examples 
of the use of index registers in addressing. 


CPU Primary Register File 


The ARAU uses the 32-bit block size register (BK) in circular addressing to 
specify the data block size. 


The system-stack pointer (SP) is a 32-bit register that contains the address 
of the top of the system stack. The SP always points to the last element pushed 
onto the stack. A push performs a preincrement; a pop performs a postdecre- 
ment of the system-stack pointer. The SP is manipulated by interrupts, traps, 
calls, returns, and the PUSH and POP instructions. See Section 6.10, System 
and User Stack Management, on page 6-29, for more information. 


The status register (ST) contains global information relating to the state of the 
CPU. Operations usually set the condition flags of the status register according 
to whether the result is 0, negative, etc. These include register load and store 
operations as well as arithmetic and logical functions. When the status register 
is loaded, however, a bit-for-bit replacement is performed with the contents of 
the source operand, regardless of the state of any bits in the source operand. 
Following a load, the contents of the status register are identical to the contents 
of the source operand. This allows the status register to be easily saved and 
restored. See Table 3-2 on page 3-6 for a list and definitions of the status regis- 
ter bits. 


The CPU/DMA interrupt-enable register (IE) is a 32-bit register. The CPU 
interrupt-enable bits are in locations 10-0. The DMA interrupt-enable bits are 
in locations 26-16. A 1 in a CPU/DMA interrupt-enable register bit enables the 
corresponding interrupt. A 0 disables the corresponding interrupt. See Sec- 
tion 3.1.8 on page 3-9 for more information. 


The CPU interrupt flag register (IF) is also a 32-bit register. A 1 ina CPU 
interrupt flag register bit indicates that the corresponding interrupt is set. A 
0 indicates that the corresponding interrupt is not set. See Section 3.1.9 on 
page 3-11 for more information. 


The I/O flag register (IOF) controls the function of the dedicated external pins, 
XFO and XF1. These pins may be configured for input or output and may also 
be read from and written to. See Section 3.1.10 on page 3-16 for more informa- 
tion. 


The repeat-counter (RC) is a 32-bit register that specifies the number of times 
to repeat a block of code when performing a block repeat. When the processor 
is operating in the repeat mode, the 32-bit repeat start-address register (RS) 
contains the starting address of the block of program memory to repeat, and 
the 32-bit repeat end-address register (RE) contains the ending address of the 
block to repeat. 
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2.4 Other Registers 


The program-counter (PC) is a 32-bit register containing the address of the 
next instruction to fetch. Although the PC is not part of the CPU register file, 
itis a register that can be modified by instructions that modify the program flow. 


The instruction register (IR) is a 32-bit register that holds the instruction opcode 
during the decode phase of the instruction. This register is used by the instruction 
decode control circuitry and is not accessible to the CPU. 


Memory Organization 


2.5 Memory Organization 


The total memory space of the ’C3x is 16M (million) 32-bit words. Program, 
data, and I/O space are contained within this 16M-word address space, allowing 
the storage of tables, coefficients, program code, or data in either RAM or 
ROM. In this way, memory usage is maximized and memory space allocated 
as desired. 


2.5.1 RAM, ROM, and Cache 


Figure 2-5 shows how the memory is organized on the ’C30. RAM blocks 0 
and 1 are each 1K x 32 bits. The ROM block, available only on the ’C30, is 
4K x 32 bits. Each RAM and ROM block is capable of supporting two CPU 
accesses in a single cycle. 


Figure 2-6 shows how the memory is organized on the ’C31/VC33. RAM 
blocks 0 and 1 are each 1K x 32 bits and support two accesses in a single 
cycle. The VC33 has two additional 16K RAM blocks bringing the total on—chip 
memory to 34K words (see the TMVS320VC33 Data Sheet, literature number 
SPRS087). A boot loader allows the loading of program and data at reset from 
8-, 16-, 32-bit-wide memories or serial port. 


Figure 2-7 shows how the memory is organized on the ‘C32. RAM blocks 0 
and 1 are each 256 x 32 bits and support two accesses in a single cycle. A 
boot loader allows the loading of program and data at reset from 1-, 2-, 4-, 8-, 
16-, and 32-bit-wide memories or serial port. The ‘C32 enhanced external 
memory interface provides the flexibility to address 8-, 16-, or 32-bit data indepen- 
dently of the external memory width. The external memory width can be 8-, 16-, 
or 32-bits wide. 


The ’C3x’s separate program, data, and DMA buses allow for parallel program 
fetches, data reads and writes, and DMA operations. For example, the CPU can 
access two data values in one RAM block and perform an external program 
fetch in parallel with the DMA controller loading another RAM block, all within 
a single cycle. 
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Figure 2-5. Memory Organization of the TMS320C30 
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Memory Organization 


Figure 2-6. Memory Organization of the TMS320C31/VC33 
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Memory Organization 


Figure 2-7. Memory Organization of the TMS320C32 
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A 64 xX 32-bit instruction cache is provided to store often-repeated sections 
of code, which greatly reduces the number of off-chip accesses. This allows 
for code to be stored off chip in slower, lower-cost memories. The external 
buses are also freed for use by the DMA, external memory fetches, or other 


devices in the system. 


See Chapter 4, Memory and the Instruction Cache, for more information. 
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Memory Organization 


2.5.2 Memory Addressing Modes 


The ’C3x supports a base set of general-purpose instructions as well as arithmetic- 
intensive instructions that are particularly suited for digital signal processing and 
other numeric-intensive applications. See Chapter 6, Addressing Modes, for more 
information. 


Four groups of addressing modes are provided on the ’C3x. Each group uses 
two or more of several different addressing types. The following list shows the 
addressing modes with their addressing types. 


LJ General instruction addressing modes: 
m@ Regisiter. The operand is a CPU register. 


m Short immediate. The operand is a 16-bit (short) or 24-bit (long) imme- 
diate value. 


m Direct. The operand is the contents of a 24-bit address formed by 
concatenating the 8 bits of data-page pointer and a 16-bit operand. 


@ Indirect. An auxiliary register indicates the address of the operand. 


[J 3-operand instruction addressing modes: 
m Regisiter. Same as for general addressing mode. 


m Indirect. Same as for general addressing mode. 


.) Parallel instruction addressing modes: 
m Register. The operand is an extended-precision register. 
m Indirect. Same as for general addressing mode. 

(1 Branch instruction addressing modes: 


m Regisiter. Same as for general addressing mode. 


m PC-relative. A signed 16-bit displacement or a 24-bit displacement is 
added to the PC. 
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2.6 Internal Bus Operation 


Much of the ’C3x’s high performance is due to internal busing and parallelism. 
Separate buses allow for parallel program fetches, data accesses, and DMA 
accesses: 


_j Program buses: PADDR and PDATA 
_} Data buses: DADDR1, DADDR2, and DDATA 
_} DMA buses: DMAADDR and DMADATA 


These buses connect all of the physical spaces (on-chip memory, off-chip 
memory, and on-chip peripherals) supported by the ’C3x. Figure 2-5, 
Figure 2-6, and Figure 2-7 show these internal buses and their connections 
to on-chip and off-chip memory blocks. 


The program counter (PC) is connected to the 24-bit program address bus 
(PADDR). The instruction register (IR) is connected to the 32-bit program data 
bus (PDATA). These buses can fetch a single instruction word every machine 
cycle. 


The 24-bit data address buses (DADDR1 and DADDR2) and the 32-bit data 
data bus (DDATA) support two data-memory accesses every machine cycle. 
The DDATA bus carries data to the CPU over the CPU1 and CPU2 buses. The 
CPU1 and CPU2 buses can carry two data-memory operands to the multiplier, 
ALU, and register file every machine cycle. Also internal to the CPU are register 
buses REG1 and REG2, which can carry two data values from the register file 
to the multiplier and ALU every machine cycle. Figure 2—4 shows the buses 
internal to the CPU section of the processor. 


The DMA controller is supported with a 24-bit address bus (DMAADDR) and 
a 32-bit data bus (DMADATA). These buses allow the DMA to perform memory 
accesses in parallel with the memory accesses occurring from the data and 
program buses. 


External Memory Interface 


2.7 External Memory Interface 


The ’C30 provides two external interfaces: the primary bus and the expansion 
bus. The ’C31/VC33 provides one external interface: the primary bus. The ’C32 
provides one enhanced external interface with three independent multi-function 
strobes. These buses consist of a 32-bit data bus and a set of control signals. The 
primary and enhanced memory buses have a 24-bit address bus, whereas the 
expansion bus has a 13-bit address bus. These buses address external program/ 
data memory or I/O space. The buses also have external RDY signals for wait- 
state generation. You can insert additional wait states under software control. 
Chapter 9, External Memory Interface, covers external bus operation. 


The ’C3x family was designed for 32-bit instructions and 32-bit data operations. 
This architecture has many advantages, including a high degree of parallelism 
and provisions for a C compiler. However, the C30 and ’C31/VC33 require a 
32-bit-wide external memory even when the data requires only 8- or 16-bit-wide 
memories. The ’C32 enhanced external memory interface overcomes this limi- 
tation by providing the flexibility to address 8-, 16-, or 32-bit data independently 
of the external memory width. In this way, the chip count and the size of external 
memory is reduced. The number of memory chips can be further reduced by 
the 'C32’s ability to allow code execution from 16- or 32-bit-wide memories. The 
’C32 memory interface also reduces the total amount of RAM by allowing the 
physical data memory to be 8, 16, or 32 bits wide. Internally, the ‘C32 has a 
32-bit architecture. So you can treat the ’C32 as a 32-bit device regardless of 
the physical external memory width. The external memory interface handles the 
conversion between external memory width and ’C32 internal 32-bit architec- 
ture. 


2.7.1_ TMS320C32 16- and 32-Bit Program Memory 


The ’C32 executes code from either 16- or 32-bit-wide memories. When 
connected to 32-bit memories, ‘C32 program execution is identical to that 
of the ’C31/VC33. When connected to 16-bit zero wait-state memory, the 
’C32 takes two instruction cycles to fetch a single 32-bit instruction. During 
the first cycle, the C32 fetches the lower 16 bits. During the second cycle, 
the ‘C32 fetches the upper 16 bits and concatenates them with the previous- 
ly fetched lower 16 bits. This process occurs entirely within the memory in- 
terface and is transparent to you. An external pin, PRGW, dictates the exter- 
nal program memory width. 
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2.7.2 TMS320C32 8-, 16-, and 32-Bit Data Memory sow! 


The ’C32 external memory interface can load and store 8-, 16-, or 32-bit quanti- 
ties into external memory and convert them into an internally-equivalent 32-bit 
representation. The external memory interface accomplishes this without 
changing the CPU instruction set. Figure 2-8 shows the supported external 
memory widths, data types and sizes for zero wait-state memory and the asso- 
ciated cycle count. 


Figure 2-8. TMS320C32-Supported Data Types and Sizes and External Memory Widths 


2-20 


Memory Width 
8 16 32 


Data 8 | 1-cycle read 1-cycle read | 1-cycle read 
Type 16 2-cycle read 1-cycle read 1-cycle read 
Size 32 4-cycle read 2-cycle read 1-cycle read 


To access 8-, 16-, or 32-bit data quantities (types) from 8-, 16-, or 32-bit-wide 
memory, the memory interface uses either strobe STRBO or STRB1, depending 
on the address location within the memory map. Each strobe consists of four pins 
for byte enables and/or additional addresses. For a 32-bit memory interface, all 
four pins are used as strobe byte-enable pins. These strobe byte-enable pins 
select one or more bytes of the external memory. For a 16-bit memory interface, 
the ‘C32 uses one of these pins as an additional address pin, while using two 
pins as strobe byte-enable pins. For an 8-bit memory interface, the ‘C32 uses 
two of these pins as additional address pins while using one pin as strobe pin. 
The ’C32 manipulates the behavior of these pins according to the contents of 
the bus control registers (one control register per strobe). By setting a few bit 
fields in this register, you indicate the data-type size and external memory width. 


2.8 


Interrupts 


Interrupts 


The ’C3x supports four external interrupts (INT3-INTO), a number of internal 
interrupts, and a nonmaskable external RESET signal. These can be used to 
interrupt either the DMA or the CPU. When the CPU responds to the interrupt, 
the IACK pin can be used to signal an external interrupt acknowledge. Section 
7.5, Reset Operation, on page 7-21 covers RESET and interrupt processing. 


The ’C30 and ’C31 external interrupts are level-triggered. To reduce external 
logic and simplify the interface, the ‘C32 external interrupts are edge- and level- 
or level-only triggered. The triggering is user-selectable through a bit in the 
status register. The VC33 external interrupts are either edge or level—trig— 
gered depending upon the state of the external EDGEMODE pin. See Sec- 
tion 3.1.7, Status Register (ST), for more information. 


Two external I/O flags, XFO and XF1, can be configured as input or output pins 
under software control. These pins are also used by the interlocked operations 
of the ’C3x. The interlocked-operations instruction group supports multiproces- 
sor communication. See Section 7.4, interlocked Operations, on page 7-13 for 
examples. 
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2.9 Peripherals 


All ’C3x peripherals are controlled through memory-mapped registers on a dedi- 
cated peripheral bus. This peripheral bus is composed of a 32-bit data bus and 
a 24-bit address bus. This peripheral bus permits straightforward communica- 
tion to the peripherals. The ’C3x peripherals include two timers and two serial 
ports (only one serial port and one DMA coprocessor are available on the 
°C31/VC33 and one serial port and two DMA coprocessor channels on the 
C32). Figure 2-9 shows these peripherals with their associated buses and sig- 
nals. See Chapter 12, Peripherals, for more information. 


Figure 2-9. Peripheral Modules 
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2.9.1. Timers 


2.9.2 Serial Ports 


Peripherals 


The two timer modules are general-purpose 32-bit timer/event counters with 
two signaling modes and internal or external clocking. They can signal internally 
to the ’C3x or externally to the outside world at specified intervals or they can 
count external events. Each timer has an I/O pin that can be used as an input 
clock to the timer, as an output signal driven by the timer, or as a general-purpose 
I/O pin. See Chapter 12, Peripherals, for more information about timers. 


The bidirectional serial ports (two on ’C30, one each on the ’C31/VC33 and ’C32) 
are totally independent. They are identical to a complementary set of control reg- 
isters that control each port. Each serial port can be configured to transfer 8, 16, 
24, or 32 bits of data per word. The clock for each serial port can originate either 
internally or externally. An internally generated divide-down clock is provided. 
The pins are configurable as general-purpose I/O pins. The serial ports can also 
be configured as timers. A special handshake mode allows ’C3x devices to 
communicate over their serial ports with guaranteed synchronization. 
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2.10 Direct Memory Access (DMA) 
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The on-chip DMA controller can read from or write to any location in the 
memory map without interfering with the CPU operation. The ’C3x can inter- 
face to slow, external memories and peripherals without reducing throughput 
to the CPU. The DMA controller contains its own address generators, source 
and destination registers, and transfer counter. Dedicated DMA address and 
data buses minimize conflicts between the CPU and the DMA controller. A 
DMA operation consists of a block or single-word transfer to or from memory. 
See Section 12.3, DMA Controller, on page 12-48 for more information. 
Figure 2-10 shows the DMA controller and its associated buses. 


The ’C30 and ’C31/VC33 DMA coprocessors have one channel, while the ’C32 
DMA coprocessor has two channels. Each channel of the C32 DMA coproces- 
sor is equivalent to the C30/31 DMA with the addition of user-configurable prior- 
ities. Because the DMA and CPU have distinct buses on the ’C3x devices, they 
can operate independently of each other. However, when the CPU and DMA 
access the same on-chip or external resources, the bandwidth can be exceeded 
and priorities must be established. The ‘C30 and ’C31/VC33 assign highest 
priority to the CPU. The ’C32 DMA coprocessor provides more flexibility by al- 
lowing you to choose one of the following priorities: 


.) CPU: For all resource conflicts, the CPU has priority over the DMA. 


_j DMA: For all resource conflicts, the DMA has priority over the CPU. 


Lj Rotating: When the CPU and DMA have a resource conflict during con- 
secutive instruction cycles, the CPU is granted priority. On the following 
cycle, the DMA is granted priority. Alternate access continues as long as 
the CPU and DMA requests conflict in consecutive instruction cycles. 


The DMA/CPU priority is configured by the DMA PRI bit fields of the corresponding 
DMA global-control register. See Section 12.3, DMA Controller, on page 12-48 for 
a complete description. 


Figure 2-10. DMA Controller 
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TMS320C30, TMS320031/VC33, and TMS320C32 Differences 


2.11 TMS320C30, TMS320C31/VC33, and TMS320C32 Differences 


Table 2-2 shows the major differences between the ’C32, ’C31/VC33, and the 
C30 devices. 
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Table 2-2. Feature Set Comparison 


Feature 


External bus 


ROM 
Boot loader 
On-chip RAM 


DMA 


Serial ports 
Timers 


Interrupts 


Interrupt vector 
table 


Package 


Voltage 


Temperature 


°C30 
Two buses: 


Lj Primary bus: 
32-bit data 
24-bit address 
STRB active for 
Oh-7FFFFFh and 
80A000h—-FFFFFFh 


Lj Expansion bus: 
32-bit data 
13-bit address 
MSTRB active for 
800000h-801FFFh 
IOSTRB active for 
804000h-805FFFh 


4k 
No 


2k 
address: 
809800h-809FFFh 


1 channel 
CPU greater priority than 
DMA 


2 
2 


Level-triggered 


Fixed 0-3Fh 


208 PQFP 
181 PGA 


5V 


0° to 85°C (commercial) 
—40 to 125°C (extended) 
—55 125°C (military) 


°C31/VC33 
One bus: 


32-bit data 

24-bit address 

STRB active Oh-7FFFFFh 
and 80A000h-FFFFFFh 


No 
Yes 


2k 
address: 
809800h-809FFFh 


1 channel 
CPU greater priority than 
DMA 


1 
2 


Level-triggered 


Microprocessor: 0-3Fh 
fixed 

Boot loader: 
809C1h-809FFFh fixed 


132 PQFP 


5 Vand 3.3 V 


0° to 85°C (commercial) 
—40 to 125°C (extended) 
—55 125°C (military) 


TMS320C30, TMS320031/VC33, and TMS320C32 Differences 


’°C32 
One bus: 


L] 32-bit data 
24-bit address 
STRBO active for 
Oh-7FFFFFh and 
880000h-—8FFFFFh; 


LI 8-, 16-, 32-bit data in 
8-, 16-, 32-bit-wide 
memory 
STRB1 active for 
900000h—FFFFFFh; 


LJ 8-, 16-, 32-bit data in 
8-, 16-, 32- bit-wide 
memory 
IOSTRB active for 
810000h-82FFFFh 


No 
Yes 


512 
address: 
87FE00h-87FFFFh 


2 channels 
Configurable priorities 


1 
2 


Level-triggered or com- 
bination of edge- and 
level-triggered 


Relocatable 


144 PQFP 


5V 


0° to 85°C (commercial) 
—40 to 125°C (extended) 
—55 125°C (military) 
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Chapter 3 


CPU Registers 


The central processing unit (CPU) register file contains 28 registers that can 
be operated on by the multiplier and arithmetic logic unit (ALU). Included in the 
register file are the auxiliary registers, extended-precision registers, and index 
registers. 


Three registers in the C32 CPU register file have been modified to support new 
features (2-channel DMAs, program execution from 16-bit memory width, etc.) 
The registers modified in the ‘C32 are: the status (ST) register, interrupt-enable 
(IE) register, and interrupt flag (IF) register. 
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CPU Multiport Register File 


3.1 CPU Multiport Register File 


The ’C8x provides 28 registers in a multiport register file that is tightly coupled to 
the CPU. The program counter (PC) is not included in the 28 registers. All of these 
registers can be operated on by the multiplier and the ALU and can be used as 
general-purpose 32-bit registers. 


Table 3-1 lists the registers’ names and assigned functions of the ’C3x. 


Table 3-1. CPU Registers 


Register 
Register Machine 
Symbol Value (hex) Assigned Function Name Section Page 
RO 00 Extended-precision register 0 3.1.1 3-3 
R1 01 Extended-precision register 1 3.1.1 3-3 
R2 02 Extended-precision register 2 3.1.1 3-3 
R3 03 Extended-precision register 3 3.1.1 3-3 
R4 04 Extended-precision register 4 3.1.1 3-3 
R5 05 Extended-precision register 5 3.1.1 3-3 
R6 06 Extended-precision register 6 3.1.1 3-3 
R7 07 Extended-precision register 7 3A 3-3 
ARO 08 Auxiliary register 0 331.2 3-4 
AR1 09 Auxiliary register 1 3.122 3-4 
AR2 OA Auxiliary register 2 3.1.2 3-4 
AR3 0B Auxiliary register 3 3.1.2 3-4 
AR4 0C Auxiliary register 4 3.1.2 3-4 
AR5 0D Auxiliary register 5 3.1.2 3-4 
AR6 OE Auxiliary register 6 3.1.2 3-4 
AR7 OF Auxiliary register 7 31.2 3-4 
DP 10 Data-page pointer 3.1.3 3-4 
IRO 11 Index register 0 3.1.4 3-4 
IR1 12 Index register 1 3.1.4 3-4 
BK 13 Block-size register 3.1.5 3-4 
SP 14 System-stack pointer 3.1.6 3-4 
ST 15 Status register 3.1.7 3-5 
IE 16 CPU/DMA interrupt-enable 3.1.8 3-9 
IF 17 CPU interrupt flags 3.1.9 3-11 
IOF 18 I/O flags 3.1.10 3-16 
RS 19 Repeat start-address 3.1.11 3-17 
RE 1A Repeat end-address 3.1.11 3-17 
RC 1B Repeat counter 3.1.11 3-17 
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The registers also have some special functions for which they are particularly 
appropriate. For example, the eight extended-precision registers are especially 
suited for maintaining extended-precision floating-point results. The eight auxiliary 
registers support a variety of indirect addressing modes and can be used as 
general-purpose 32-bit integer and logical registers. The remaining registers 
provide system functions, such as addressing, stack management, processor 
status, interrupts, and block repeat. See Chapter 6, Addressing Modes, for more 
information. 


3.1.1. Extended-Precision Registers (R7—R0) 


The eight extended-precision registers (R7—RO) can store and support operations 
on 32-bit integer and 40-bit floating-point numbers. These registers consist of two 
separate and distinct regions: 


Lj Bits 39-32: dedicated to storage of the exponent (e) of the floating-point 
number. 


_j Bits 31-0: store the mantissa of the floating-point number: 


Bit 31: sign bit (s) 
Bits 30-0: the fraction (f) 


Any instruction that assumes the operands are floating-point numbers uses 
bits 39-0. Figure 3-1 illustrates the storage of 40-bit floating-point numbers 
in the extended-precision registers. 


Figure 3-1. Extended-Precision Register Floating-Point Format 
39 32 31 30 


a Mantissa _ 


For integer operations, bits 31-0 of the extended-precision registers contain 
the integer (signed or unsigned). Any instruction that assumes the operands 
are either signed or unsigned integers uses only bits 31-0. Bits 39-32 remain 
unchanged. This is true for all shift operations. The storage of 32-bit integers 
in the extended-precision registers is shown in Figure 3-2. 


oO 


Figure 3-2. Extended-Precision Register Integer Format 
39 a2. 31 


Unchanged Signed or unsigned integer 
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3.1.2 Auxiliary Registers (AR7—ARO) 


The CPU can access the eight 32-bit auxiliary registers (AR7—-ARO), and the 
two auxiliary register arithmetic units (ARAUs) can modify them. The primary 
function of the auxiliary registers is the generation of 24-bit addresses. However, 
they can also operate as loop counters in indirect addressing or as 32-bit general- 
purpose registers that can be modified by the multiplier and ALU. See Chap- 
ter 6, Addressing Modes, for more information. 


3.1.3. Data-Page Pointer (DP) 


3.1.4 


The data-page pointer (DP) is a 32-bit register that is loaded using the load data 
page (LDP) instruction (see Chapter 13, Assembly Language Instructions). The 
eight LSBs of the data-page pointer are used by the direct addressing mode as a 
pointer to the page of data being addressed (see Section 6.3, Direct Addressing, 
on page 6-4). Data pages are 64K-words long, with a total of 256 pages. Bits 31-8 
are reserved; you must always keep these set to 0 (cleared). 


Index Registers (IRO, IR1) 


The 32-bit index registers (IRO and IR1) are used by the ARAU for indexing 
the address. See Chapter 6, Addressing Modes, for more information. 


3.1.5 Block Size (BK) Register 


The 32-bit block size register (BK) is used by the ARAU in circular addressing to 
specify the data block size. See Section 6.7, Circular Addressing, on page 6-21 
for more information. 


3.1.6 System-Stack Pointer (SP) 
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The system-stack pointer (SP) is a 32-bit register that contains the address of the 
top of the system stack. The SP always points to the last element pushed onto 
the stack. The SP is manipulated by interrupts, traps, calls, returns, and the 
PUSH, PUSHF, POP, and POPF instructions. Stack pushes and pops perform 
preincrements and postdecrements on all 32 bits of the SP. However, only the 
24 LSBs are used as an address. See Section 6.10, System and User Stack 
Management, on page 6-29 for more information. 
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3.1.7 Status (ST) Register 


The status (ST) register contains global information about the state of the CPU. 
Operations usually set the condition flags of the status register according to 
whether the result is 0, negative, etc. This includes register load and store 
operations as well as arithmetic and logical functions. However, when the 
status register is loaded, the contents of the source operand replace the ST’s 
contents bit for bit, regardless of the state of any bits in the source operand. 
Following an ST load, the contents of the status register are identical to the 
contents of the source operand. This allows the status register to be saved 
easily and restored. At system reset, a 0 is written to this register. 


Figure 3-3 shows the format of the status register for the C30 and ’C31/VC33 
devices. Figure 3-4 shows the format of the status register for the ‘C32 device. 
Table 3-2 describes the status register bits, their names, and their functions. 


Figure 3-3. Status Register (TMS320C30 andTMS320C31/VC33) 


31-16, 15 14 13.12 4 11 10 9 8. 7 6 5 4 
ty ty 


' RW RW,RW RW RW. RW RW RW RW, RW RW RW RW 


Notes: 1) xx =reserved bit, read as 0 
2) R=read, W =write 


Figure 3-4. Status Register (TMS320C32 Only) cowl 


31-16! 15 (4.78 4a Ph to EG Gh 


1 
xx aPRGW] INT | Ge | cot cE | ce | xx | RM+ovm| LUE] LV | uF+ N | z/] vio 
1 status | config ' ' ' 


\ R RW RW RW) RW ORM Rw; RW RW RW ORMW YY RW ORW RW sR 


Notes: 1) xx =reserved bit, read as 0 
2) R=read, W =write 
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Table 3-2. Status Register Bits 


Bit Name Reset Value Name 
Cc 0 Carry flag 
V 0 Overflow flag 
Z 0 Zero flag 
N 0 Negative flag 
UF 0 Floating-point under- 
flow flag 
LV 0 Latched overflow flag 
LUF 0 Latched floating-point 
underflow flag 
OVM 0 Overflow mode flag 
RM 0 Repeat mode flag 
CE 0 Cache enable 
Note: 
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Description 


Carry condition flag 
Overflow condition flag 
Zero condition flag 
Negative condition flag 


Floating-point underflow condition flag 


Latched overflow condition flag 


Latched floating-point underflow condition flag 


Overflow mode flag 
The overflow mode flag affects only integer operations. 


If OVM = 0, the overflow mode is turned off and integer 
results that overflow are treated in no special way. 


If OVM = 1, integer results overflowing in the positive 
direction are set to the most positive, 2s-complement 
number (7FFF FFFFh), and integer results overflowing 
in the negative direction are set to the most negative 
32-bit, 2s-complement number (8000 0000h). 


Repeat mode flag 


If RM = 1, the PC is modified in either the repeat-block 
or repeat-single mode. 


CE enables or disables the instruction cache. 


Set CE = 1 to enable the cache, allowing the cache to 
be used according to the least recently used (LRU) 
stack manipulation. 


Set CE = 0 to disable the cache, preventing cache 
updates or modifications (no cache fetches can be 
made). Cache clearing (CC = 1) is allowed when 
CE=0. 


If a load of the status register occurs simultaneously with a CPU interrupt pulse trying to reset GIE, GIE is reset. 


Table 3-2. Status Register Bits (Continued) 


Bit Name Reset Value Name 


CF 0 Cache freeze 

CC 0 Cache clear 

GIE 0 Global interrupt-enable 

INT config 0 Interrupt configuration 
(‘C32 only) 

Note: 
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Description 
Enables or disables the instruction cache 


Set CF = 1 to freeze the cache (cache is not updated), 
including LRU stack manipulation. If the cache is 
enabled (CE = 1), fetches from the cache are allowed, 
but modification of the cache contents is not allowed. 
Cache clearing (CC = 1) is allowed. At reset, this bit 
is cleared to 0, but it is set to 1 after reset. 


When CF = 0, the cache is automatically updated by 
instruction fetches from external memory. Also, when 
CF = 0, cache clearing (CC = 1) is allowed. 


The following table summarizes the CE and CF bits: 


CE CF Effect 
0 0 Cache not enabled 
0 1 Cache not enabled 
1 0 Cache enabled and not frozen 
1 1 Cache enabled but frozen 


(cache read only) 


CC = 1 invalidates all entries in the cache. This bit is 
always cleared after it is written to, and is always read 
as 0. At reset, 0 is written to this bit. 


If GIE = 1, the CPU responds to an enabled interrupt. 


If GIE = 0, the CPU does not respond to an enabled 
interrupt. 


Sets the external interrupt signals INT3 — INTO for level- 
or edge-triggered interrupts. 


INT Config Effect 


0 All the external interrupts (INT3 — INTO) 
are configured as_ level-triggered 
interrupts. Multiple interrupts may be 
triggered when the signal is active for 
a long period of time. 


| All the external interrupts (INT3 — INTO) 
are configured as edge-triggered inter- 
rupts. Edge and duration are required 
for all interrupts to be recognized. 


If a load of the status register occurs simultaneously with a CPU interrupt pulse trying to reset GIE, GIE is reset. 
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Table 3-2. Status Register Bits (Continued) 


Bit Name Reset Value Name Description 

PRGW Dependent Program width status Indicates the status of the external input PRGW pin. 
on PRGW (‘C32 only) When the signal of the PRGW pin is high, the PRGW 
pin level status bit is set to 1, indicating a 16-bit memory width. 


Note: 
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The ‘C32 performs two fetches to retrieve a single 32-bit 
instruction word. The PRGW bit is a read-only bit, and 
can have the following values: 


PRG Effect 


0 Instruction fetches use one 32-bit exter- 
nal program memory read. 


1 Instruction fetches use two 16-bit exter- 
nal program memory reads. 


If a load of the status register occurs simultaneously with a CPU interrupt pulse trying to reset GIE, GIE is reset. 
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3.1.8 CPU/DMA Interrupt-Enable (IE) Register 


The CPU/DMA interrupt-enable (IE) register of the C30, 'C31/VC33, and ’C32 
are 32-bit registers (see Figure 3-5 and Figure 3-6). The CPU interrupt-enable 
bits are in locations 10-0 for ‘C30 and ’C31/VC33 devices, and 11-0 for ’C32 
devices. The direct memory access (DMA) interrupt-enable bits are in locations 
26-16 for ‘C30 and ‘C31/VC33 devices, and 31-16 for C32 devices. A1 ina 
CPU/DMA IE bit enables the corresponding interrupt. A 0 disables the corre- 
sponding interrupt. At reset, 0 is written to this register. 


Table 3-3 describes the interrupt-enable register bits, their names, and their 


functions. 


Figure 3-5. CPU/DMA Interrupt-Enable (IE) Register 
(TMS320C30 and TMS320C031/VC33) 


31 30 29 28° 27 a (CS 23 20 ! 19 


Bor ltscdlboclipcensoa| EDINT ETINT1 ETINTO ERINT1 EXINT1 ERINTO EXINTO EINT3 EINT2 EINT1 EINTO 
(DMA) (DMA) (DMA) (DMA) (DMA) (DMA) (DMA) (DMA) (DMA) (DMA) (DMA) 


15 14 13 12°41 8 7 4 3 
EDINT ETINT1 ETINTO ERINT1 EXINT1 ERINTO EXINTO EINT3 EINT2 EINT1 EINTO 
XX | XX | XX Bes XX (CPU) (CPU) (CPU) (CPU) (CPU) (CPU) (CPU) (CPU) (CPU) (CPU) (CPU) 


Notes: 1) xx =reserved bit, read as 0 
2) R=read, W =write 


Figure 3-6. CPU/DMA Interrupt-Enable (IE) Register (TMS320C32) 


EINT3 | EINT2 | EINT1 | EINTO LEDINTO EDINT1 | ETINT4 SS eon ETINTO | ERINTO EXINTO 4 EINT3 | EINT2 | EINT1 | EINTO 
(DMA1)} (DMA1) | (DMA1)}(DMA1) » (DMA1) | (DMAO) | (DMAO) | (DMAO) # (DMA1)} (DMA1) | (DMA1) | (DMAO) , (DMAO)} (DMAO) | (DMAO) |(DMA0) 


R/W ; R/W R/W RW , RW R/W # R/W 


EDINT1 EDINTO ETINT1 ETINTO ERINTO EXINTO EINT3 EINT2 EINT1 EINTO 
(CPU) (CPU) (CPU) (CPU) , (CPU) (CPU) (CPU) (CPU) (CPU) (CPU) 


RW 46 R/W RIW 4 R/W 


Notes: 1) xx =reserved bit, read as 0 
2) R=read, W =write 
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Table 3-3. IE Bits and Functions 


Abbreviation 
EINTO (CPU 
EINT1 (CPU 
EINT2 (CPU 
EINT3 (CPU 
EXINTO (CPU) 
ERINTO (CPU) 
EXINT1 (CPU) 
ERINT1 (CPU) 
ETINTO (CPU 
ETINT1 (CPU 
EDINT (CPU) 


) 
) 
) 
) 


) 
) 


EDINTO (CPU) 
EDINT1 (CPU) 
EINTO (DMA) 


EINT1 (DMA) 
EINT2 (DMA) 
EINT3 (DMA) 


EINTO (DMAO 
EINT1 (DMAO 
EINT2 (DMAO 
EINT3 (DMAO 
EXINTO (DMA) 


) 
) 
) 
) 


ERINTO (DMA) 


EXINT1 (DMA) 


Reset 
Value 


0 


(on © © © ee 2 ee ee ee 2 eee 2 eee) 


oO 


on On oe ee) 


Description 

CPU external interrupt 0 enable 

CPU external interrupt 1 enable 

CPU external interrupt 2 enable 

CPU external interrupt 3 enable 

CPU serial port 0 transmit interrupt enable 

CPU serial port 0 receive interrupt enable 

CPU serial port 1 transmit interrupt enable (’C30 only) 
CPU serial port 1 receive interrupt enable (’C30 only) 
CPU timer0 interrupt enable 

CPU timer1 interrupt enable 


CPU DMA controller interrupt enable 
(‘C30 and ’'C31/VC33 only) 


CPU DMAO controller interrupt enable (’C32 only) 
CPU DMAY1 controller interrupt enable (’C32 only) 


DMA external interrupt 0 enable (’C30 and ’C31/VC33 
only) 


DMA external interrupt 1 enable (’C30 and ’C31/VC33 
only) 


DMA external interrupt 2 enable (’C30 and ’C31/VC33 
only) 


DMA external interrupt 3 enable (C30 and ’C31/VC33 
only) 

DMAO external interrupt 0 enable (’C32 only) 
DMAO external interrupt 1 enable (’C32 only) 
DMAO external interrupt 2 enable (’C32 only) 
DMAO external interrupt 3 enable (’C32 only) 
DMA serial port 0 transmit interrupt enable 
(‘C30 and 'C31/VC33 only) 


DMA serial port 0 receive interrupt enable 
(‘C30 and 'C31/VC33 only) 


DMA serial port 1 transmit interrupt enable (’C30 only) 
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Table 3-3. IE Bits and Functions(Continued) 


Reset 
Abbreviation Value Description 


ERINT1 (DMA) 0 DMA serial port 1 receive interrupt enable (’C30 only) 


EXINTO (DMAO) 0 DMAO serial port 1 transmit interrupt enable (’C32 only) 

ERINTO (DMA1) 0 DMA1 serial port 1 receive interrupt enable (’C32 only) 

ETINTO (DMA) 0 DMA timer0 interrupt enable (C30 and ’C31/VC33) 

ETINT1 (DMA) 0 DMA timer1 interrupt enable (‘C30 and ’C31/VC33 only) 

ETINTO (DMAO) 0 DMAO timer1 interrupt enable (’C32 only) 

ETINT1 (DMAO) 0 DMAO timer1 interrupt enable (’C32 only) 

ETINTO (DMA1) 0 DMA‘1 timerO interrupt enable (’C32 only) 

ETINT1 (DMA1) 0 DMA1 timer1 interrupt enable (’C32 only) 

EDINT (DMA) 0 DMA controller interrupt enable (’C30 and ’C31/VC33 
only) 

EDINT1 (DMAO) 0 DMAO-DMA1 controller interrupt enable (’C32 only) 

EDINTO (DMA1) 0 DMA1-DMA0 controller interrupt enable (’C32 only) 

EINTO (DMA1) 0 DMA1 external interrupt 0 enable (’C32 only) 

EINT1 (DMA1) 0 DMA1 external interrupt 1 enable (’C32 only) 

EINT2 (DMA1) 0 DMA1 external interrupt 2 enable (’C32 only) 

EINT3 (DMA1) 0 DMA1 external interrupt 2 enable (’C32 only) 


3.1.9 CPU Interrupt Flag (IF) Register 


Figure 3-7, Figure 3-8, and Figure 3-9 show the 32-bit CPU interrupt flag reg- 
isters (IF) for the ‘C30, ‘C31/VC33, and ‘C32 devices, respectively. A 1 ina CPU 
IF register bit indicates that the corresponding interrupt is set. The IF bits are 
set to 1 when an interrupt occurs. They may also be set to 1 through software 
to cause an interrupt. A 0 indicates that the corresponding interrupt is not set. 
If a 0 is written to an IF register bit, the corresponding interrupt is cleared. At re- 
set, 0 is written to this register. Table 3-4 describes the interrupt flag register 
bits, their names, and their functions. 
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Figure 3-7. TMS320C30 CPU Interrupt Flag (IF) Register 


31-16 115-12) 11 


‘ 


R/W RW! RW RW RW 


Notes: 1) xx =reserved bit, read as 0 
2) yy =reserved bit, set to 0 at reset; can store value 
3) R=read, W = write 


Figure 3-8. TMS320C31/VC33 CPU Interrupt Flag (IF) Register 


31-16 415-12) 11 10 
XX « yy t DINT | TINT1 | TINTO« xx mE RINTO| XINTO « INT3 | INT2 | INT1 | INTO 
: : R/W RW! RW RW. RW 


Notes: 1) xx =reserved bit, read as 0 
2) yy =reserved bit, set to 0 at reset 
3) R=read, W = write 


Figure 3-9. TMS320C32 CPU Interrupt Flag (IF) Register 


31-16 115-12, 11 


, ; R/W RW | R/IW R/W R/W 


Notes: 1) xx =reserved bit, read as 0 
2) R=read, W =write 
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Table 3-4. IF Bits and Functions 


Bit 
Name 
INTO 
INT1 
INT2 
INT3 
XINTO 
RINTO 
XINT1 
RINT1 
TINTO 
TINT1 
DINT 
DINTO 
DINT1 
ITTP 


Note: 


Reset 
Value Function 


0 External interrupt 0 flag 
External interrupt 1 flag 
External interrupt 2 flag 
External interrupt 3 flag 
Serial port 0 transmit flag 
Serial port 0 receive flag 


Serial port 1 transmit flag (‘C30 only) 


0 

0 

0 

0 

0 

0 

0 Serial port 1 receive interrupt flag (‘C30 only) 
0 Timer 0 interrupt flag 

0 Timer 1 interrupt flag 

0 DMA channel interrupt flag (‘C30 and ‘C31/VC33 only) 
0 DMAO channel interrupt flag (‘C32 only) 

0 DMA1 channel interrupt flag (‘C32 only) 

0 


Interrupt-trap table pointer (see Section 3.1.9.1) 
Allows the relocation of interrupt and trap vector tables (‘C32 only) 
If a load of the interrupt flag (IF) register occurs simultaneously with a set of a flag by an 


interrupt pulse, the loading of the flag has higher priority and overwrites the value of the 
interrupt flag register. 
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3.1.9.1 Interrupt-Trap Table Pointer (ITTP) cow! 


Similar to the rest of the ‘C3x device family, the 'C32’s reset vector location 
remains at address 0. However, the interrupt and trap vectors are relocatable. 
This is achieved by the interrupt-trap table pointer (ITTP) bit field in the CPU 
interrupt flag register, shown in Figure 3-9. The ITTP bit field dictates the 
starting location (base) of the interrupt-trap vector table. This base address 
is formed by left shifting by eight bits the value of the ITTP bit field. This shifted 
value is called the effective base address and is referenced as EA[ITTP], as 
shown in Figure 3-10. Therefore, the location of an interrupt or trap vector 
is given by the addition of the effective base address formed by the ITTP bit 
field (EA[ITTP]) and the offset of the interrupt or trap vector in the interrupt- 
trap vector table, as shown in Figure 3-11. For example, if the ITTP contains 
the value 100h, the serial port transmit interrupt vector is located at 10005h. 
Note that the vectors stored in the interrupt-trap vector table are the addresses 
of the start of the respective interrupt and trap routines. Furthermore, the 
interrupt-trap vector table must lie on a 256-word boundary, since the eight 
LSBs of the effective base address of the interrupt-trap vector table are 0. 


See Section 7.6, /nterrupts, on page 7-26 for more information on interrupt 
vector tables. 


Figure 3-10. Effective Base Address of the Interrupt-Trap Vector Table 


23 8 7 0 


EA[ITTP] = Bits 31-16 of the CPU interrupt flag register 00000000 
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Figure 3-11.Interrupt and Trap Vector Locations 


EA (ITTP) + 00h Reserved 


EA (ITTP) + 01h INTO 


EA (ITTP) + 02h INT1 


EA (ITTP) + 03h INT2 


EA (ITTP) + 04h INT3 


EA (ITTP) + 05h XINTO 


EA (ITTP) + 06h RINTO 


EA (ITTP) + 07h Reserved 


EA (ITTP) + 08h Reserved 


EA (ITTP) + 09h TINTO 


EA (ITTP) + OAh TINT1 


EA (ITTP) + OBh DINTO 


EA (ITTP) + 0Ch DINT1 


EA (ITTP) + 0Dh 


Reserved 


EA (ITTP) + 1Fh 


EA (ITTP) + 20h 


EA (ITTP) + 3Bh TRAP27 


EA (ITTP) + 3Ch TRAP28 (reserved) 


EA (ITTP) + 3Dh TRAP29 (reserved) 


EA (ITTP) + 3Eh TRAP30 (reserved) 


EA (ITTP) + 3Fh TRAP31 (reserved) 
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3.1.10 I/O Flag (IOF) Register 


The I/O flag (IOF) register is shown in Figure 3-12 and controls the function 
of the dedicated external pins, XFO and XF1. These pins can be configured for 
input or output. The pins can also be read from and written to. At reset, 0 is 
written to this register. Table 3-5 describes the I/O flags register bits, their 
names, and their functions. 


Figure 3-12. I/O Flag (IOF) Register 


31-16 | 1512 {| 11-8 | 7 6 5 4 | 3 2 1 0 
: \ , oR R/IW RW ; OR RW RW 
Notes: 1) xx =reserved bit, read as 0 
2) R=read, W = write 
Table 3-5. IOF Bits and Functions 
Reset 

Bit Name Value Function 
VOXFO 0 If 0, XFO is configured a general-purpose input pin. 


If 1, XFO is configured a general-purpose output pin. 


OUTXFO 0 Data output on XFO. 
INXFO 0 Data input on XFO. A write has no effect. 
VOXF1 0 If 0, XF1 is configured a general-purpose input pin. 


If 1, XF1 is configured a general-purpose output pin. 
OUTXF1 0 Data output on XF1. 


INXF1 0 Data input on XF1. A write has no effect. 
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3.1.11 Repeat-Counter (RC) and Block-Repeat (RS, RE) Registers 


The repeat-counter (RC) register is a 32-bit register that specifies the number 
of times a block of code is to be repeated when a block repeat is performed. 
If RC contains the number n, the loop is executed n + 1 times. 


The 32-bit repeat start-address (RS) register contains the starting address of 
the program-memory block to be repeated when the CPU is operating in the 
repeat mode. 


The 32-bit repeat end-address (RE) register contains the ending address of 
the program-memory block to be repeated when the CPU is operating in the 
repeat mode. 


SS SSS —S ————_—os—  — —————m—a 6 
Note: RE<RS 


If RE< RS and the block mode is enabled, the code between RE and RS is 
bypassed when the program counter encounters the repeat end (RE) ad- 


dress. 
 C.}Nn”",”!T——P>—.—>—E—EFEEFEeEeEemTzT eee] 
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Other Registers 


3.2 Other Registers 


3.2.1. Program-Counter (PC) Register 


The program counter (PC) is a 32-bit register containing the address of the 
next instruction fetch. While the program-counter register is not part of the 
CPU register file, it can be modified by instructions that modify the program 
flow. 


3.2.2 Instruction Register (IR) 


The instruction register (IR) is a 32-bit register that holds the instruction op- 
code during the decode phase of the instruction. This register is used by the 
instruction decode control circuitry and is not accessible to the CPU. 


Reserved Bits and Compatibility 


3.3. Reserved Bits and Compatibility 


To retain compatibility with future members of the ’C3x family of microprocessors, 
reserved bits that are read as 0 must be written as 0. You must not modify the 
current value of a reserved bit that has an undefined value. In other cases, you 
should maintain the reserved bits as specified. 
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Chapter 4 


Memory and the Instruction Cache 


The ’C3x provides a total memory space of 16M (million) 32-bit words that contain 
program, data, and I/O space. Two RAM blocks of 1K x 32 bits each (available 
on the C30 and ’C31) or two RAM blocks of 256 x 32 bits (available on the ’C32) 
and a ROM block of 4K x 32 bits (available only on the ’C30) or boot loader 
(available on the ’C31 and the ’C32) permit two CPU accesses in a single cycle. 
The VC33 contains four RAM blocks totaling 34K words plus an internal bootload- 
er identical to the C31 bootloader. 


A 64 x 32-bit instruction cache stores often-repeated sections of code, greatly 
reducing the number of off-chip accesses and allowing code to be stored off-chip 
in slower, lower-cost memories. 


| | 


Note: VC33 


The TMS320VC33 is logically and functionally a superset of the TMS320C31 
making nearly all literature and documentation for the TMS320C31 applica— 
ble. However, the electrical characteristics are different and are covered in 
detail in the TMS320VC33 data sheet (literature number SPRS087). 
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Memory 


4.1 Memory 


The 'C3x accesses a total memory space of 16M (million) 32-bit words of pro- 
gram, data, and I/O space and allows tables, coefficients, program code, or 
data to be stored in either RAM or ROM. In this way, you can maximize memory 
usage and allocate memory space as desired. 


RAM blocks 0 and 1 are each 1K x 32 bits on the 'C30 and ’C31/VC33. The 
ROM block is 4K x 32 bits on the ’C30. The ’C31/VC33 and ’C32 have a boot 
ROM. By manipulating one external pin (MC/MP or MCBL/MP), you can config- 
ure the first 1000h words of memory to address the on-chip ROM or external 
RAM. Each on-chip RAM and ROM block can support two CPU accesses in a 
single cycle. The separate program buses, data buses, and DMA buses allow for 
parallel program fetches, data reads/writes, and DMA operations, which are cov- 
ered in Chapter 11, Peripherals. 


4.1.1 Memory Maps 


The following sections describe the memory maps for the ’C30, ’C31/VC33, 
and ’C32. 


4.1.1.1 TMS320C30 Memory Map 
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The memory map depends on whether the processor is running in micro- 
processor mode (MC/MP = 0) or microcomputer mode (MC/MP = 1). The 
memory maps for these modes are similar (see Figure 4-1 on page 4-4). 
Locations 800000h-—801FFFh are mapped to the expansion bus. When this 
region is accessed, MSTRB is active. Locations 802000h—803FFFh are 
reserved. Locations 804000h—805FFFh are mapped to the expansion bus. 
When this region is accessed, IOSTRB is active. Locations 806000h- 
807FFFh are reserved. All of the memory-mapped peripheral bus registers 
are in locations 808000h—8097F Fh. In both modes, RAM block 0 is located 
at addresses 809800h-809BFFh, and RAM block 1 is located at addresses 
809C00h-809FFFh. Locations 80A000h-OFFFFFFh are accessed over the 
external memory port (STRB active). 


Lj Microprocessor Mode 


In microprocessor mode, the 4K on-chip ROM is not mapped into the ’C3x 
memory map. Locations Oh-03Fh consist of interrupt vector, trap vector, 
and reserved locations, all of which are accessed over the external memory 
port (STRB active) (see Figure4-1 on page 4-4). Locations 
040h-7FFFFFh are also accessed over the external memory port. 


Memory 


[J Microcomputer Mode 


In microcomputer mode, the 4K on-chip ROM is mapped into locations 
Oh-OFFFh. There are 192 locations (Oh-OBFh) within this block for interrupt 
vectors, trap vectors, and a reserved space (’'C30). Locations 1000h-— 
7FFFFFh are accessed over the external memory port (STRB active). 


Section 4.1.2, Peripheral Bus Memory Map, on page 4-9 describes the peripheral 
memory maps in greater detail and Section 4.2, Resei/Interrupt/Trap Vector Map, 
on page 4-14 provides the vector locations for reset, interrupts, and traps. 


Be careful! Access to a reserved area produces unpredictable 
results. 
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Figure 4—1. TMS320C30 Memory Maps 
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7FFFFFh 
800000h 


801FFFh 
802000h 


803FFFh 
804000h 


805FFFh 
806000h 


807FFFh 
808000h 


8097FFh 
809800h 


809BFFh 
809C00h 


809FFFh 
80A000h 


FFFFFFh 


On Reset, interrupt, trap vectors, 
and reserved locations (64) 
03Fh (external STRB active) 
040h 


External 
STRB active 
(8.192M words) 


Expansion bus 
MSTRB active 
(8K words) 


Reserved 
(8K words) 


Expansion bus 
IOSTRB active 


(8K words) 


Reserved 
(8K words) 


Peripheral bus 
memory-mapped 
registers 
(6K words internal) 


RAM block 0 
(1K words internal) 


RAM block 1 
(1K words internal) 


External 
STRB active 
(7.96M words) 


Microprocessor mode 


Oh 


OBFh 
o0COh 


OFFFh 
1000h 


7FFFFFh 
800000h 


801FFFh 
802000h 


803FFFh 
804000h 


805FFFh 
806000h 


807FFFh 
808000h 


8097FFh 
809800h 


809BFFh 
809C00h 


809FFFh 
80A000h 


FFFFFFh 


Reset, interrupt, trap vectors, 
and reserved locations (192) 


(Internal) 


External 
STRB active 
(8.188M words) 


Expansion bus 
MSTRB active 
(8K words) 


Reserved 
(8K words) 


Expansion bus 


IOSTRB active 
(8K words) 


Reserved 
(8K words) 


Peripheral bus 
memory-mapped 
registers 
(Internal) 

(6K words internal) 


RAM block 0 
(1K words internal) 


RAM block 1 
(1K words internal) 


External 
STRB active 
(7.96M words) 


Microcomputer mode 


Memory 


4.1.1.2 TMS320C31/VC33 Memory Map 


The memory map depends on whether the processor is running in micropro- 
cessor mode (MCBL/MP = 0) or microcomputer mode (MCBL/MP = 1). The 
memory maps for these modes are similar (see Figure 4-2 on page 4-6). 
Locations 800000h-807FFFh are reserved. All of the memory-mapped 
peripheral bus registers are in locations 808000h-8097F Fh. In both modes, 
RAM block 0 is located at addresses 809800h-809BFFh, and RAM block 1 is 
located at addresses 809CO00h-809FFFh. Locations 80A000h-OFFFFFFh 
are accessed over the external memory port (STRB active). 


_J Microprocessor Mode 


In microprocessor mode, the boot loader is not mapped into the ’‘C3x 
memory map. Locations Oh-03Fh consist of interrupt vector, trap vector, 
and reserved locations, all of which are accessed over the external 
memory port (STRB active) (see Figure 4-2 on page 4-6). Locations 
040h-—7FFFFFh are also accessed over the external memory port. 


_) Microcomputer Mode 


In microcomputer mode, the boot loader ROM is mapped into locations 
Oh-OFFFh. The last 63 words (809FC1h to 809FFFh) of internal RAM 
Block 1 are used for interrupt and trap branches (see Figure 4—2 on page 
4-6). Locations 1000h—-7FFFFFh are accessed over the external 
memory port (STRB active). 


Section 4.1.2, Peripheral Bus Memory Map, on page 4-9 describes the 
peripheral memory maps in greater detail and Section 4.2, Reset/Interrupt/ 
Trap Vector Map, on page 4-14 provides the vector locations for reset, inter- 
rupts, and traps. 


Be careful! Access to a reserved area produces unpredictable 
results. 
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Figure 4-2. TMS320C31/VC33 Memory Maps 


Oh 5 Oh 
Reset, interrupt, trap vectors, 
and reserved locations (64) 
03Fh (external STRB active) Reserved for boot- 
040h loader operationst 
OFFFh 
1000h 
External External 
STRB active STRB 
(8.192M words) active 
(8.188M words) 
400000h 
7FFFFFh 7FFFFFh 
800000h 800000h 
Reserved Reserved 
(32K words) (32K words) 
807FFFh 807FFFh 
808000h 808000h 


Peripheral bus 
memory-mapped 
registers 
(6K words internal) 


Peripheral bus 
memory-mapped 
registers 
(6K words internal) 


8097FFh 8097FFh 
809800h 809800h 
RAM block 0 RAM block 0 
(1K words internal) (1K words internal) 
809BFFh 809BFFh 
809C00h 809C00h 


809FFFh 
80A000h 


FFFFFFh 


RAM block 1 
(1K words internal) 


External 
STRB active 
(7.96M words) 


809FCOh 
809FCth 


809FFFh 
80A000h 


FFFOOOh 


FFFFFFh 


RAM block 1 
(1K — 63 words internal) 


User program interrupt 
and trap branches 
(63 words internal) 


External 
STRB 
active 

(7.96M words) 


Microprocessor mode Microcomputer/boot-loader mode 


T See Section 3.1 .3, Data-Page Pointer (DP), on page 3-4 for more information. 
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4.1.1.3. TMS320C32 Memory Map cow! 


The memory map depends on whether the processor is running in micropro- 
cessor mode (MCBL/MP = 0) or microcomputer mode (MCBL/MP = 1). The 
memory maps for these modes are similar (see Figure 4-3 on page 4-8). 
Locations 800000h-807FFFh, 809800h-80FFFh, and 8830000H-87FDFFh are 
reserved. Locations 810000h—82FFFFh are mapped to the external bus with 
IOSTRB active. All of the memory-mapped peripheral bus registers are in loca- 
tions 808000h-8097FFh. In both modes, RAM block 0 is located at addresses 
87FE00h-87FEFFh, and RAM block 1 is located at addresses 87FFOOh-— 
87FFFFh. Locations 900000h-FFFFFFh are mapped to the external bus with 
STRB1 active. 


Unlike the fixed interrupt-trap vector table location of the ‘C30 and ’C31/VC33 
devices, the ’C32 has a user-relocatable interrupt-trap vector table. The inter- 
rupt-trap vector table must start on a 256-word boundary. The starting location 
is programmed through the interrupt-trap table pointer (ITTP) bit field in the CPU 
interrupt flag (IF) register. See Section 3.1.9.1, /nterrupt-Trap Table Pointer 
(ITTP), on page 3-14. 


_J Microprocessor Mode 


In microprocessor mode, the boot loader is not mapped into the ‘C3x memory 
map. Locations Oh-7FFFFFFh are accessed over the external memory port 
(STRBO active) with location Oh containing the reset vector. 


_) Microcomputer Mode 


In microcomputer mode, the on-chip boot loader ROM is mapped into 
locations Oh-OFFFh. Locations 1000h—-7FFFFFh are accessed over the 
external memory port (STRBO active). 


The C32 boot loader has additional modes over the ’'C31/VC33 boot loader to han- 
dle the data types, sizes, and memory widths supported by the external memory 
interface. The memory boot load supports data transfer with and without hand- 
shaking. The handshake mode allows synchronous program transfer by using 
two pins as data-acknowledge and data-ready signals. 


See Section 4.1.2, Peripheral Bus Memory Map, on page 4-9 and Section 4.2, 
Resev/Interrupt/Trap Vector Map, on page 4-14 for more information. 


Be careful! Access to a reserved area produces unpredictable 
results. 
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Figure 4-3. TMS320C32 Memory Maps 


Oh Oh 
Reset-vector location Reserved for 
boot-loader operations 
OFFFh 
1000h Boot 1 


External memory 100th 


STRBO active 
(8.192M words) 


External memory 
STRBO active 
(8.188M words) 


7FFFFFh 7FFFFFh 
800000h 800000h 
Reserved Reserved 
(32K words) (32K words) 
807FFFh 807FFFh 
808000h Peripheral bus 808000h Peripheral bus 
memory-mapped registers memory-mapped registers 
8097FFh (6K words internal) 8097FFh (6K words internal) 
809800h 809800h 
Reserved Reserved 
(26K words) (26K words) 
80FFFFh 80FFFFh 
810000h 810000h Boot 2 


810001h 


External memory 
IOSTRB active (128K) 
(128K words) 


External memory 
IOSTRB active (128k) 
(128K words) 


82FFFFh 82FFFFh 
830000h Reserved 830000h Reserved 
87FDFFh (319.5K words) 87FDFFh (319.5K words) 
87FE00h RAM block 0 87FE00h RAM block 0 
87FEFFh (256 words internal) 87FEFFh (256 words internal) 
87FFOOh RAM block 1 87FFOOh RAM block 1 
87FFFFh (256 words internal) 87FFFFh (256 words internal) 
880000h 880000h 
External memory External memory 
STRBO active STRBO active 
(512K words) (512K words) 
8FFFFFh BFFFFFh 
900000h 900000h Boot 3 


External memory 900001h 
STRB1 active 
(7.168M words) 


External memory 
STRB1 active 
(7.168M words) 


FFFFFFh FFFFFFh 


Microprocessor mode Microcomputer/boot-loadermode 
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4.1.2 Peripheral Bus Memory Map 


The following sections describe the peripherial bus memory maps for the ’C30, 
°C31/VC33, and ’C32. 


4.1.2.1_| TMS320C30 Peripheral Bus Memory Map 


The ’C30 memory-mapped peripheral registers are located starting at address 
808000h. Figure 4—4 on page 4-10 shows the peripheral bus memory map. The 
shaded blocks are reserved. 
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Figure 4-4. TMS320C30 Peripheral Bus Memory-Mapped Registers 


808000h DMA global control 


808004h DMA source address 


808006h DMA destination address 


808008h DMA transfer counter 


808020h Timer 0 global control 


808024h Timer 0 counter 


808028h Timer 0 period 


808030h Timer 1 global control 


808034h Timer 1 counter 


808038h Timer 1 period register 


808040h Serial port 0 global control 


808042h | FSX/DX/CLKX serial port 0 control 
808043h | FSR/DR/CLKR serial port 0 control 
808044h Serial port 0 R/X timer control 
808045h Serial port 0 R/X timer counter 
808046h Serial port 0 R/X timer period 


808048h Serial port 0 data transmit 


80804Ch Serial port 0 data receive 


808050h Serial port 1 global control 


808052h | FSX/DX/CLKX serial port 1 control 
808053h | FSR/DR/CLKR serial port 1 control 
808054h Serial port 1 R/X timer control 
808055h Serial port 1 R/X timer counter 
808056h Serial port 1 R/X timer period 


808058h Serial port 1 data transmit 


80805Ch Serial port 1 data receive 


808060h Expansion-bus control 


808064h Primary-buscontrol 
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4.1.2.2 TMS320C31/VC33 Peripheral Bus Memory Map 


The ’C31/VC33 memory-mapped peripheral registers are located starting at 
address 808000h. Figure 4—5 shows the peripheral bus memory map. The 
shaded blocks are reserved. 


Figure 4-5. TMS320C31/VC33 Peripheral Bus Memory-Mapped Registers 


808000h DMA global control 


808004h DMA source address 


808006h DMA destination address 


808008h DMA transfer counter 


808020h Timer 0 global control 


808024h Timer 0 counter 


808028h Timer 0 period 


808030h Timer 1 global control 


808034h Timer 1 counter 


808038h Timer 1 period register 


808040h Serial port global control 


808042h | FSX/DX/CLKX serial port control 
808043h | FSR/DR/CLKR serial port control 
808044h Serial port R/X timer control 
808045h Serial port R/X timer counter 
808046h Serial port R/X timer period 


808048h Serial port data transmit 


80804Ch Serial port data receive 


808064h Primary-bus control 
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4.1.2.3 TMS320C32 Peripheral Bus Memory Map cow! 


The ’C32’s memory-mapped peripheral and external-bus control registers are 
located starting at address 808000h, as shown in Figure 4-6 on page 4-13. The 
shaded blocks are reserved. 
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Figure 4-6. TMS320C32 Peripheral Bus Memory-Mapped Registers 


808000h DMA 0 global control 


808004h DMA 0 source address 


808006h DMA 0 destination address 


808008h DMA 0 transfer counter 
808010h DMA 1 global control 
808014h DMA 1 source address 


808016h DMA 1 destination address 


808018h DMA 1 transfer counter 
808020h Timer 0 global control 
808024h Timer 0 counter 
808028h Timer 0 period 
808030h Timer 1 global control 
808034h Timer 1 counter 
808038h Timer 1 period register 


808040h Serial port global control 


808042h | FSX/DX/CLKX serial port control 


808043h | FSR/DR/CLKR serial port control 


808044h Serial port R/X timer control 
808045h Serial port R/X timer counter 


808046h Serial port R/X timer period 


808048h Serial port data transmit 


80804Ch Serial port data receive 
808060h IOSTRB bus control 
808064h STRBO bus control 
808068h STRB1 bus control 
8097FFh 
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Reset/Interrupt/Trap Vector Map 


4.2 Reset/Interrupt/Trap Vector Map 


el 
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The addresses for the reset, interrupt, and trap vectors are OOh-3Fh, as shown 
in Figure 4-7 and Figure 4-8. The reset vector contains the address of the reset 
routine. 


L) 


°C30 and ’C31/VC33 Microprocessor and Microcomputer Modes 


In the microprocessor mode of the ’C30 and ’C31/VC33 and the micro- 
computer mode of the ’C30, the reset interrupt and trap vectors stored in 
locations Oh-3Fh are the addresses of the starts of the respective reset, 
interrupt, and trap routines. For example, at reset, the content of memory 
location 00h (reset vector) is loaded into the PC, and execution begins 
from that address (see Figure 4-8 on page 4-16). 


°C31/VC33 Microcomputer/Boot-Loader Mode 


In the microcomputer/boot-loader mode of the ’C31/VC33, the interrupt 
and trap vectors stored in locations 809FC1h-809FFFh are branch in- 
structions to the start of the respective interrupt and trap routines (see 
Figure 4-9 on page 4-17). 


*C32 Microprocessor and Microcomputer/Boot-Loader Mode 


The ’C32 has a user-relocatable interrupt-trap vector table. The interrupt- 
trap vector table must start on a 256-word boundary. The starting location is 
programmed through the interrupt-trap table pointer (ITTP) bit field in the 
CPU interrupt flag (IF) register. See Section 3.1.9.1, /nterrupt-Trap Table 
Pointer (ITTP), on page 3-14. The reset vector is stored at location Oh in 
microprocessor mode. 


Reset/Interrupt/Trap Vector Map 


Figure 4—7. Reset, Interrupt, and Trap Vector Locations for the TMS320C30 
Microprocessor Mode 


0Ch 
Reserved 
1Fh 


20h TRAP 0 


Note: Traps 28-31 


Traps 28-31 are reserved; do not use them. 
| | 
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Reset/Interrupt/Trap Vector Map 


Figure 4—8. Reset, Interrupt, and Trap Vector Locations for theTMS320C31/VC33 
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Microprocessor Mode 


OCh 
Reserved 
1Fh 


20h 


3Ch TRAP 28 (reserved 


) 
3Dh TRAP 29 (reserved) 
) 
) 


3Eh TRAP 30 (reserved 


3Fh TRAP 31 (reserved 


Note: Traps 28-31 


Traps 28-31 are reserved; do not use them. 
| 


Reset/Interrupt/Trap Vector Map 


Figure 4-9. Interrupt and Trap Branch Instructions for the TMS320C31/VC33 
Microcomputer Mode 


809FCCh 
809FE0h 
809FE1h 


809FFFh TRAP 31 (reserved) 


Te | 


Note: Traps 28-31 
Traps 28-31 are reserved; do not use them. 


Unlike the ’C31/VC33’s microprocessor mode, the ’'C31/VC33 microcomput- 

er/boot loader mode uses a dual-vectoring scheme to service interrupts and 

trap requests. In this dual vectoring scheme, a branch instruction rather than 

a vector address is used. 
a) 
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Reset/Interrupt/Trap Vector Map 


Figure 4-10. Interrupt and Trap Vector Locations for TMS320C32 


EA (ITTP) + 00h Reserved 


EA (ITTP) + 01h INTO 


EA (ITTP) + 02h INT1 


EA (ITTP) + 03h INT2 


EA (ITTP) + 04h INT3 


EA (ITTP) + 05h XINTO 


EA (ITTP) + 06h RINTO 


EA (ITTP) + 07h Reserved 


EA (ITTP) + 08h Reserved 


EA (ITTP) + 09h TINTO 


EA (ITTP) + OAh TINT1 


EA (ITTP) + OBh DINTO 


EA (ITTP) + 0Ch DINT1 


EA (ITTP) + 0Dh 


Reserved 


EA (ITTP) +1Fh 


EA (ITTP) + 20h 


EA (ITTP) + 3Bh TRAP27 


EA (ITTP) + 3Ch TRAP28 (reserved) 


EA (ITTP) + 3Dh TRAP29 (reserved) 


EA (ITTP) + 3Eh TRAP30 (reserved) 


EA (ITTP) + 3Fh TRAP31 (reserved) 


a a aT a | 


Note: Traps 28-31 


Traps 28-31 are reserved; do not use them. 
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4.3 Instruction Cache 


A 64 x 32-bit instruction cache speeds instruction fetches and lowers system 
cost by caching program fetches from external memory. The instruction cache 
allows the use of slow, external memories while still achieving single-cycle access 
performances. This reduces the number of off-chip accesses necessary and 
allows code to be stored off-chip in slower, lower-cost memories. The cache 
also frees external buses from program fetches so that they can be used by 
the DMA or other system elements. 


The cache can operate automatically, with no user intervention. Subsection 
4.3.2 describes a form of the least recently used (LRU) cache update algorithm. 


4.3.1. Instruction-Cache Architecture 


The instruction cache (see Figure 4-12) contains 64 32-bit words of RAM; it 
is divided into two 32-word segments. A 19-bit segment start address (SSA) 
register is associated with each segment. For each word in the cache, there 
is a corresponding single bit-present (P) flag. 


When the CPU requests an instruction word from external memory, the cache 
algorithm checks to determine if the word is already contained in the instruction 
cache. Figure 4-11 shows how the cache-control algorithm partitions an 
instruction address. The algorithm uses the19 most significant bits (MSBs) of 
the instruction address to select the segment; the five least significant bits 
(LSBs) define the address of the instruction word within the pertinent segment. 
The algorithm compares the 19 MSBs of the instruction address with the two 
SSA registers. If there is a match, the algorithm checks the relevant P flag. The 
P flag indicates if a word within a particular segment is already present in cache 
memory: 


_} P= 1: the word is already present in cache memory 


_j P=0: the location cache is invalid 


Figure 4—11.Address Partitioning for Cache Control Algorithm 
23 5 4 0 


Segment start address Instruction word 
(SSA) address within segment 


If there is no match, one of the segments must be replaced by the new data. The 
segment replaced in this circumstance is determined by the LRU algorithm. The 
LRU stack (see Figure 4—12) is maintained for this purpose. 
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Instruction Cache 


Figure 4-12. Instruction-Cache Architecture 


Segment start P 
address registers flags Segment words LRU 


A A Stack 
SSA register 0 | 0 | Segment word 0 
19 >| Segment word 1 


MRU segment number 


i LRU segment number 
. : Segment 0 
Segment word 30 
Segment word 31 
le 32 >| 
SSA register 1 } 0 | Segment word 0 
Segment word 1 
: : Segment 1 


The LRU stack determines which of the two segments qualifies as the least 
recently used after each access to the cache. Each time a segment is accessed, 
its segment number is removed from the LRU stack and pushed onto the top 
of the LRU stack. Therefore, the number at the top of the stack is the most re- 


cently used (MRU) segment number, and the number at the bottom of the stack 
is the least recently used segment number. 


At reset, the LRU stack is initialized with 0 at the top and 1 at the bottom. All 
P flags in the instruction cache are cleared. 


When a replacement is necessary, the LRU segment is selected for replace- 
ment. Also, the 32 P flags for the segment to be replaced are set to 0, and the 
segment’s SSA register is replaced with the 19 MSBs of the instruction address. 
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4.3.2 Instruction-Cache Algorithm 


When the ’C3x requests an instruction word from external memory, one of two 
possible actions occurs: a cache hit or a cache miss. 


Lj 


Cache Hit. The cache contains the requested instruction, and the following 
actions occur: 


m The instruction word is read from the cache. 


m The number of the segment containing the word is removed from the 
LRU stack and pushed to the top of the LRU stack (if it is not already at 
the top), thus moving the other segment number to the bottom of the 
stack. 


Cache Miss. The cache does not contain the instruction. There are two 
types of cache misses: 


m Subsegment miss. The segment address register matches the instruc- 
tion address, but the relevant P flag is not set. The following actions 
occur in parallel: 


The instruction word is read from memory and copied into the cache. 


The number of the segment containing the word is removed from 
the LRU stack and pushed to the top of the LRU stack (if it is not 
already at the top), thus moving the other segment number to the 
bottom of the stack. 


The relevant P flag is set. 


m Segment miss. Neither of the segment addresses matches the instruc- 
tion address. The following actions occur in parallel: 


The LRU segment is selected for replacement. The P flags for all 
32 words are cleared. 


The SSA register for the selected segment is loaded with the 
19 MSBs of the address of the requested instruction word. 


The instruction word is fetched and copied into the cache. It goes 
into the appropriate word of the LRU segment. The P flag for that 
word is set to 1. 


The number of the segment containing the instruction word is 
removed from the LRU stack and pushed to the top of the LRU 
stack, thus moving the other segment number to the bottom of 
the stack. 
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Instruction Cache 


Only instructions may be fetched from the program cache. All reads and writes 
of data in memory bypass the cache. Program fetches from internal memory 
do not modify the cache and do not generate cache hits or misses. The pro- 
gram cache is a single-access memory block. Dummy program fetches (for 
example, those following a branch) are treated by the cache as valid program 
fetches and can generate cache misses and cache updates. 


EF ———— — << — ccc —  —— a ——_ i 
Notes: Using Self-Modifying Code 


Be careful when using self-modifying code. If an instruction resides in the 
cache and the corresponding location in primary memory is modified, the 
copy of the instruction in the cache is not modified. 


You can use the cache more efficiently by aligning program code on 32-word 
address boundaries. Do this with the .align directive when coding assembly 


language. 
| 


4.3.3. Cache Control Bits 
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Three cache control bits are located in the CPU status register: 


(J Cache Clear Bit (CC). Set CC = 1 to invalidate all entries in the cache. This 
bit is always cleared after it is written to; it is always read as a 0. At reset, 
the cache is cleared, and 0 is written to this bit. 


(1 Cache Enable Bit (CE). Set CE = 1 to enable the cache, allowing the 
cache to be used according to the LRU cache algorithm. Set CE = 0 to dis- 
able the cache; this prevents cache update modifications (thus, no cache 
fetches can be made). At reset, 0 is written to this bit. Cache clearing (CC 
= 1) is allowed when CE = 0. 


(J Cache Freeze Bit (CF). Set CF = 1 to freeze both the cache and LRU 
stack manipulation. If the cache is enabled (CE = 1) and the cache is 
frozen (CF = 1), fetches from the cache are allowed, but modification of 
cache contents is not allowed. Cache clearing (CC = 1) is allowed when 
CF =1 or CF = 0. At reset, this CF bit is cleared to 0. 


Table 4—1 shows the combined effect of the CE and CF. 
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Table 4-1. Combined Effect of the CE and CF Bits 


CE CF Effect 
0 0 Cache not enabled 
0 1 Cache not enabled 
1 0 Cache enabled and not frozen 
1 1 Cache enabled and frozen 


When the CE or CF bits of the CPU status register are modified, the following 
four instructions may or may not be fetched from the cache or external memory 
(see Example 4-1). 


When the CC bit of the CPU status register is modified, the following five instruc- 
tions may or may not be fetched from the cache before the cache is cleared (see 
Example 4-1). 


Example 4-1. Pipeline Effects of Modifying the Cache Control Bits 


Pipeline Operation 


Cycle | Fetch Decode | Read | Execute 
Instructions may 
n LDI 1000h, ST 
an be fetched before 
il cache is enabled 

n+1 LDI 1h, R1 LDI 1000h, ST or frozen. 

n+2 LDI 2h, R2 LDI 1h, R1 LDI 1000h, ST Cache cleared 

n+3 LDI 3h, R3 LDI 2h, R2 LDI 1h, R1 LDI 1000h, ST# 
Instructions may 
be fetched before | 4,4 LDI 4h, R4 LDI 3h, R3 LDI 2h, R2 LDI ih, R1 
cache cleared. za 

n+5 | KDI 5h, R5 LDI 4h, R4 LDI 3h, R3 LDI 2h, R2 

n+6 LDI 5h, R5 LDI 4h, R4 LDI 3h, R3 

n+7 LDI 5h, R5 LDI 4h, R4 

n+8 LDI Sh, R5 
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Chapter 5 


Data Formats and Floating-Point Operation 


In the ’C3x architecture, data is organized into three fundamental types: integer, 
unsigned integer, and floating-point. The terms integer and signed integer are 
equivalent. The ‘C3x supports short and single-precision formats for signed and 
unsigned integers. It also supports short, single-precision, and extended- 
precision formats for floating-point data. 


Floating-point operations make fast, trouble-free, accurate, and precise com- 
putations. Specifically, the ‘C3x implementation of floating-point arithmetic facili- 
tates floating-point operations at integer speeds, while preventing problems with 
overflow, operand alignment, and other burdensome tasks that are common in 
integer operations. 


This chapter discusses data formats and floating-point operations supported 


in the ’C3x. 
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Integer Formats 


5.1. Integer Formats 


The ’C3x supports two integer formats: a 16-bit short-integer format and a 
32-bit single-precision integer format. 


Note: 


When extended-precision registers are used as integer operands, only bits 
31-0 are used; bits 39-32 remain unchanged. 


5.1.1 Short-Integer Format 


The short-integer format is a 16-bit 2s-complement integer format for immediate- 
integer operands. For those instructions that assume integer operands, this 
format is sign-extended to 32 bits (see Figure 5-1). The range of an integer 
si, represented in the short-integer format, is -215< si< 215 ~ 1. In Figure 5-1, 
S = signed bit. 


Figure 5-1. Short-Integer Format and Sign-Extension of Short Integers 
15 


oO 


Short-integer format 


31 16 15 0 


Sign-extension of a short integer 


5.1.2 Single-Precision Integer Format 


In the single-precision integer format, the integer is represented in 2s-comple- 
ment notation. The range of an integer sp, represented in the single-precision 
integer format, is -231 < sp < 231 - 1. Figure 5-2 shows the single-precision 
integer format. 


Figure 5-2. Single-Precision Integer Format 
31 


Oo 
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Unsigned-Integer Formats 


5.2 Unsigned-Integer Formats 
The ’C3x supports two unsigned-integer formats: a 16-bit short format and a 
32-bit single-precision format. 


ccs | 
Note: 


In extended-precision registers, the unsigned-integer operands use only bits 
31-0; bits 39-32 remain unchanged. 


| | 
5.2.1. Short Unsigned-Integer Format 


Figure 5-3 shows the16-bit, short, unsigned-integer format for immediate 
unsigned-integer operands. For those instructions which assume unsigned- 
integer operands, this format is zero filled to 32 bits. The range of a short 
unsigned integer is 0< si< 216. 


Figure 5-3. Short Unsigned-Integer Format and Zero Fill 
15 0 


SSS Ss) 


Short unsigned-integer format 


31 16 15 0 


Zero fill of a short unsigned integer 


5.2.2 Single-Precision Unsigned-Integer Format 


In the single-precision unsigned-integer format, the number is represented as 
a 32-bit value, as shown in Figure 5-4. The range of a single-precision unsigned 
integer is 0 < sp < 282. 


Figure 5-4. Single-Precision Unsigned-Integer Format 
31 0 


po 
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Floating-Point Formats 


5.3 Floating-Point Formats 


The ’C3x supports four floating-point formats: 


Li Ashort floating-point format for immediate floating-point operands, consisting 
of a 4-bit exponent, a sign bit, and an 11-bit fraction 


(1 (C82 only) A short floating-point format for use with 16-bit floating-point 
data types, consisting of a 2s-complement, 8-bit exponent field, a sign bit, 
and a 7-bit fraction 


1] Asingle-precision floating-point format by an 8-bit exponent field, a sign 
bit, and a 23-bit fraction 


[J An extended-precision floating-point format consisting of an 8-bit exponent 
field, a sign bit, and a 31-bit fraction. 


All ’C3x floating-point formats consist of three fields: an exponent field (e), a 
single-bit sign field (s), and a fraction field (f). The sign field and fraction field 
may be considered as one unit and referred to as the mantissa field (man). 


Figure 5-5. General Floating-Point Format 
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Ie Mantissa >| 


The general equation for calculating the value in a floating-point number is: 


X = SS.f, X 2° 


In the equation, s is the value of the sign bit, s is the inverse of the value of the 
sign bit, fis the binary value of the fraction field, and e is the decimal equivalent 
of the exponent field. 


The mantissa represents a normalized 2s-complement number. In a normalized 
representation, a most significant nonsign bit is implied, thus providing an addi- 
tional bit of precision. The implied sign bit is used as follows: 


Lj If s=0, then the leading two bits of the mantissa are 01. 
Lj If s=1, then the leading two bits of the mantissa are 10. 


If the sign bit, s, is equal to 0, the mantissa becomes 01.fo, where fis the binary 
representation of the fraction field. If s is 1, the mantissa becomes 10.fo, where f 
is the binary representation of the fraction field. 


For example, if f = 000000000015 and s = 0, the value of the mantissa (man) 
is 01.00000000001>. If s = 1 for the same value of f, the value of manis 
10.00000000001>5. 


Floating-Point Formats 


The exponent field is a 2s-complement number that determines the factor of 2 
by which the number is multiplied. Essentially, the exponent field shifts the 
binary point in the mantissa. If the exponent is positive, then the binary point is 
shifted to the right. If the exponent is negative, then the binary point is shifted 
to the left. 


For example, if man = 01.00000000001o and the e = 1149, then the binary point 
is shifted 11 places to the right, producing the number: 01000000000015, 
which is equal to 2049 decimal. 


5.3.1. Short Floating-Point Format 


In the short floating-point format, floating-point numbers are represented by a 
2s-complement, 4-bit exponent field (e) and a 2s-complement, 12-bit mantissa 
field (man) with an implied most significant nonsign bit (see Figure 5-6). 


Figure 5-6. Short Floating-Point Format 


15 12 11 10 0 
I< Mantissa > 


Operations are performed with an implied binary point between bits 11 and 10. 
When the implied most significant nonsign bit is made explicit, it is located to the 
immediate left of the binary point. The floating-point 2s-complement number x 
in the short floating-point format is given by the following: 


xX = 01.6 2¢ ifs=0 
x = 10.fx 2¢ ifs=1 
x=0 ife=-8 


You must use the following reserved values to represent 0 in the short floating- 
point format: 


e=-8 
s= 0 
f= 0 
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Floating-Point Formats 


The following examples illustrate the range and precision of the short floating- 
point format: 


Most positive: x= (2-27-11) x27 =2.5594 x 102 
Least positive: x=1x2-7=7.8125x 10-3 
Least negative: x= (-1-2-11) x 2-7 = -7.8163 x 10-3 
Most negative: x=-2 x27 =-2.5600 x 102 
5.3.2 TMS320C32 Short Floating-Point Format for External 16-Bit Data sow! 


To facilitate the handling of 16-bit floating-point data types, the ‘C32 uses a new 
short floating-point format for external 16-bit data types. Note that the following 
short floating-point format is used only in external 16-bit floating-point data 
access. This format is different than the 16-bit immediate short floating-point 
data format used in the ‘C32’s instruction set. 


In the short floating-point format for external 16-bit data-type size, floating-point 
numbers are represented by a 2s-complement, 8-bit exponent field (e), a sign bit 
(s), and an 8-bit mantissa field (man) with an implied most significant nonsign bit. 


Figure 5—7. TMS320C32 Short Floating-Point Format for External 16-Bit Data 
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15 8 76 0 


ne Mantissa gl 


Operations are performed with an implied binary point between bits 7 and 6. 
When the implied most significant nonsign bit is made explicit, it is located to the 
immediate left of the binary point. The floating-point 2s-complement number x 
in the short floating-point format is given by: 


xX = 01fx2¢ ifs=0 
xX = 10.4 x 2¢ ifs=1 
x =0 if e=-128 


You must use the following reserved values to represent 0 in the ‘C32 short 
floating-point format for external 16-bit data: 


e = -128 
s=0 
f =0 


Floating-Point Formats 


The following examples illustrate the range and precision of the ‘C32 short 
floating-point format for external 16-bit data: 


= (2-2-8) x 2127 — 3.3961775 x 1038 

=1 x 2-127 — 5.8774717541 x 10-39 

= (-1-2-8) x 2-127 — -5.9004306 x 10-39 
= (-2 x 2127) = -3.4028236 x 1038 


Most positive: 
Least positive 
Least negative: 
Most negative: 


Note that the floating-point instructions (such as LDF, MPYF, ADDF) and the 
integer instructions (such as LDI, MPYI, ADDI) produce different results when 
accessing the same memory location. The integer load instructions store the 
value in the LSBs of the ‘C32’s registers. A bit field in the strobe control register 
controls sign extension or zero fill of the MSBs of the integer value. On the other 
hand, the floating-point load instructions store the value in the MSBs of the 
‘C32’s registers. For example: 


If AR1 = 4000h, R1 = 00 00000000h, the value stored at memory location 
4000h is 0180h, and STRBO is configured for a physical memory size and data 
type size of 16 bits. 


The result of: ADDI*AR1,R1_ is R1 = 00 00000180h, while 


The result of: ADDF *“AR1,R1 is R1 = 01 COQ000000h (= — 3.0), since 
-4.0+1.0=-3.0 


5.3.3 Single-Precision Floating-Point Format 


In the single-precision format, the floating-point number is represented by an 
8-bit exponent field (e) and a 2s-complement 24-bit mantissa field (man) with 
an implied most significant nonsign bit (see Figure 5-8). 


Figure 5-8. Single-Precision Floating-Point Format 


31 2423 22 0 
| J Mantissa >| 


Operations are performed with an implied binary point between bits 23 and 22. 
When the implied most significant nonsign bit is made explicit, it is located to 
the immediate left of the binary point. The floating-point number x is given by 


the following: 

xX = 01.fx2¢ ifs=0 

x = 10.fx 2¢ ifs=1 
x=0 ife=-128 
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Floating-Point Formats 


You must use the following reserved values to represent 0 in the single-precision 
floating-point format: 


e = - 128 
s=0 
f =0 


The following examples illustrate the range and precision of the single-precision 
floating-point format: 


Most positive: = (2 — 2-23) x 2127 = 3.4028234 x 1038 
= 1 x 2-127 = 5.8774717 x 10-39 
= (-1-2-28) x 2-127 = _5.8774724 x 10-89 


= —2 x 2127 = _3.4028236 x 1038 


Least positive: 
Least negative: 


<x «<— &— Xx 


Most negative: 


5.3.4 Extended-Precision Floating-Point Format 


In the extended-precision format, the floating-point number is represented by 
an 8-bit exponent field (e€) and a 32-bit mantissa field (man) with an implied 
most significant nonsign bit (see Figure 5-9). 


Figure 5-9. Extended-Precision Floating-Point Format 
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39 32 31 30 0 
« Mantissa P 


Operations are performed with an implied binary point between bits 31 and 30. 
When the implied most significant nonsign bit is made explicit, it is located to 
the immediate left of the binary point. The floating-point number x is given by 
the following: 


X = 01.f x 2e ifs=0 
X =10.f x 2° ifs=1 
X= if e = -128 


You must use the following reserved values to represent 0 in the extended- 
precision floating-point format: 


e = -128 
s=0 
f =0 


Floating-Point Formats 


The following examples illustrate the range and precision of the extended- 
precision floating-point format: 

Most positive: X = (2 - 2-23) x 2127 = 3.4028234 x 1038 

Least positive: x=1x 2-127 = 5.8774717541 x 1038 

Least negative: x= (-1-2-31) x 2-127 = -5.8774717569 x 10-89 

Most negative: x=—-2x 2127 =-3.4028236691 x 1038 


5.3.5 Determining the Decimal Equivalent of a TMS320C3x Floating-Point Format 


To convert a ‘C3x floating-point number to its decimal equivalent, follow these 
steps: 


Step 1: Convert the exponent field to its decimal representation. 


The exponent field is a 2s-complement number. To convert a 2s- 
complement number, look at the MSB. If it is 0, then convert the 
binary number to a decimal number. If the MSB is 1, then comple- 
ment the binary number, add 1 to the result, and then convert this 
binary number to a decimal number. 


Step 2: Convert the mantissa field to its decimal representation. 


The mantissa field is represented as a sign-mantissa number with an 
implied 1 and an implied binary point between the sign bit and the frac- 
tion field. If the sign bit is cleared (s = 0), form the mantissa by writing 
01, and appending the bits in the fraction field after the binary point. 
For example, if f= 101000000009, then man = 01.101000000002: 


Fraction 


Ss 
EREZEIRAEAESES ESSERE 


Rewrite the mantissa as: 


Mantissa 


ERED EARS ESESESE eee 


If the sign bit is set (s = 1), form the mantissa by writing 10 and appending the 
bits in the fraction field after the binary point. For example, if f= 101000000005, 
then man = 10.10100000000>. 


s Fraction 


[ee ee eee eee ee 
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Rewrite the mantissa as: 


Mantissa 


EBRD FS ERES ER EE EEE ES 


Step 3: 


Shift the decimal point of the mantissa according to the value of the 
exponent. 


If the exponent is positive, shift the binary point to the right by the value 
of the exponent. If the exponent is negative, shift the binary point to 
the left. 


For example, if e = 249 and the man = 01.11000000000z, then the 
shifted mantissa becomes 0111.000000000», which is equivalent to 
7 in decimal. 


If, on the other hand, e = -249 and man = 01.10000000000z, then the 
shifted mantissa becomes 0.0110000000000p, which is equivalent 
to 3/8 in decimal. 


The following examples illustrate how you can obtain the equivalent 
floating-point value of a number in ‘C3x floating-point format. Each 
of the examples uses the single-precision floating-point format. 


Example 5-1. Positive Number 
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0 2 4 0 0 0 0 0 Hex value 
0000 0010 0100 0000 0000 0000 0000 0000 Binary value 
Exponent = 0000 00105 = 2 
Sign = 0 
Fraction = -100005 
Value = 01.15 x 22 = 01105. = 6 

——$ 2 Fraction 
Implied 


Sign 


Example 5-2. Negative Number 


Floating-Point Formats 


0 1 Cc 0 0 0 0 0 
0000 0001 1100 0000 0000 0000 0000 0000 


Hex value 
Binary value 


Example 5-3. Fractional Number 


Exponent = 0000 00015 = 1 
Sign = i 
Fraction = 100005 
Value = 10.15 x 21 = 1015. = -3 
= Fraction 
Implied 
Sign 
F B 4 0 0 0 0 0 Hex value 


1111 1011 0100 0000 0000 0000 0000 0000 


Exponent = 1111 10115 = -5 

Sign = 0 2-5 
Fraction = -100005 |-— 
Value = 01.1, x 2-5 = .0000115 = 3/64 


a Fraction 


Implied 
Sign 


Binary value 
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5.3.6 Conversion Between Floating-Point Formats 


Floating-point operations assume several different formats for inputs and out- 
puts. These formats often require conversion from one floating-point format to 
another (for example, short floating-point format to extended-precision floating- 
point format). Format conversions occur automatically in hardware, with no 
overhead, as a part of the floating-point operations. Examples of the four con- 
versions are shown in Figure 5-10 through Figure 5-13. When a floating-point 
format 0 is converted to a greater-precision format, it is always converted to a 
valid representation of 0 in that format. In Figure 5-10 through Figure 5-13, 
S = Sign bit of the exponent, y = short mantissa, and x = short exponent. 


Figure 5-10. Converting from Short Floating-Point Format to Single-Precision 
Floating-Point Format 


Short floating-point format 
31 27 24 23 22 12 11 0 


COrreerrs Ol CC 


Single-precision floating-point format 


In this format, the exponent field is sign extended, and the 12 LSBs of the mantissa 
field are filled with Os. 


Figure 5-11. Converting from Short Floating-Point Format to Extended-Precision 
Floating-Point Format 
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39 35 32 31 30 20 19 0 


EXEeerery Fl EA 


Extended-precision floating-point format 


The exponent field in this format is sign extended, and the 20 LSBs of the mantissa 
field are filled with Os. 


Floating-Point Formats 


Figure 5-12. Converting from Single-Precision Floating-Point Format 
to Extended-Precision Floating-Point Format 


31 24 23 22 0 


Extended-precision floating-point format 


The 8 LSBs of the mantissa field are filled with Os. 


Figure 5-13. Converting from Extended-Precision Floating-Point Format 
to Single-Precision Floating-Point Format 


39 32 31 30 8 7 0 


Extended-precision floating-point format 


31 24 23 22 


Single-precision floating-point format 


The 8 LSBs of the mantissa field are truncated. 


ao 
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5.4 Floating-Point Conversion (IEEE Std. 754) 


The ‘C8x floating-point format is not compatible with the IEEE standard 754 
format. The IEEE floating-point format uses sign-magnitude notation for the 
mantissa, and the exponent is biased by 127. In a 32-bit word representing a 
floating-point number, the first bit is the sign bit. The next eight bits correspond 
to the exponent, which is expressed in an offset-by-127 format (the actual expo- 
nent is e -127). The next 23 bits represent the absolute value of the mantissa 
with the most significant 1 implied. The binary point follows this most significant 
1. In other words, the mantissa actually has 24 bits (see Figure 5-14). There are 
several special cases, summarized below. 


These are the values of the represented numbers in the IEEE floating-point 
format: 


X = (-1)§ x 2€-127 x (01.4) ifO<e< 255 


Figure 5-14. IEEE Single-Precision Std. 754 Floating-Point Format 
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31 30 23 | 22 0 
eS Se ees 
lq mantissa >| 


The following five cases define the value v of a number expressed in the IEEE 
format: 


1) If e=255 and f#0, then v= NaN 
2) if e=255 and f=0, then v = (-1)s infinite 
3) If O<e<255, then v= (-1)§x 26-127(1.f) 
4) If e=0 and f+0, then v= (-1)§ x 2-126(0.f ) 
5) If e=0 and f=0, then v=(-1)§x0 
where: 
s= sign bit 


e = the exponent field 

f = the fraction field 

NaN = not a number 
For the above five representations, e is treated as an unsigned integer. Case 
1 generates NaN (not an number) and is primarily used for software signaling. 


Case 4 represents a denormalized number. Case 5 represents positive and 
negative 0. 


Floating-Point Conversion (IEEE Std. 754) 


Figure 5-15. TMS320C3x Single-Precision 2s-Complement Floating-Point Format 
31 24 2322 0 


[ee 
Note: Same format as for the 'C4x 


In comparison, Figure 5-15 shows the the ‘C3x 2s-complement floating-point 
format. In this format, two cases can be used to define value v of a number: 


1) If e=-128 then v=0 
2) If e#-128 then v= ss.fo x 2 
where: 
s = sign bit 


e = the exponent field 
f = the fraction field 


For this representation, e is treated as a 2s-complement integer. The fraction 
and sign bit form a normalized 2s-complement mantissa. 


Note: Differentiating Symbols for IEEE and TMS320C3x Formats 


To differentiate between the symbols that define these two formats, all IEEE 
fields are subscripted with an IEEE (for example, ejEEE, S|EEE, and so forth). 
Similarly, all 2s-complement fields are subscripted with 2 (that is, eo, So, fo). 


5.4.1 Converting IEEE Format to 2s-Complement TMS320C3x Floating-Point Format 


The most common conversion is the IEEE-to-2s-complement format. This 
conversion is done according to rules in Table 5-1. 


Table 5-1. Converting IEEE Format to 2s-Complement Floating-Point Format 


If these values are present Then these values equal 
Description Case ejFEE SIEEE fiEEE eo So fo 
max neg oo 1 255 1 any 7Fh 1 00 0000h 
max POS oo 2 255 0 any 7Fh 0 7F FREFFh 
3 O<eeee <255 0 flEEE 9 eiEEE-7Fh 0 flEEE 
4 O<epee <255 1 #0 elEEE-7Fh 1 figee+ it 
5 O<epee<255 1 0 eiEEE-80h 1 0 
zero 6 0 any any 80h 0 00 0000h 


t # EEE = 1s complement of f/EEE 
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Case 1 maps the IEEE positive NaNs and positive infinity to the single-preci- 
sion 2s-complement most positive number. Overflow is also signaled to allow 
you to check for these special cases. 


Case 2 maps the IEEE negative NaNs and negative infinity to the single- 
precision 2s-complement most negative number. Overflow is also signaled 
to allow you to check for these special cases. 


Case 3 maps the IEEE positive normalized numbers to the identical value in 
the 2s-complement positive number. 


Case 4 maps the IEEE negative normalized numbers with a nonzero fraction 
to the identical value in the 2s-complement negative number. 


Case 5 maps the IEEE negative normalized numbers with a 0 fraction to the 
identical value in the 2s-complement negative number. 


Case 6 maps the IEEE positive and negative denormalized numbers and positive 
and negative Os to a 2s-complement 0. 


Based on these definitions of the formats, two versions of the conversion routines 
were developed. One version handles the complete definition of the formats. The 
other ignores some of the special cases (typically the ones that are rarely used), 
but it has the benefit of executing faster than the complete conversion. For this 
discussion, the two versions are referred to as the complete version and the fast 
version, respectively. 


Floating-Point Conversion (IEEE Std. 754) 


5.4.1.1 IEEE-to-TMS320C3x Floating-Point Format Conversion 


Example 5—4 shows the fast conversion from IEEE to ’C3x floating-point format. 
It properly handles the general case when 0 < e < 255, and also handles Os (that 
is, @= 0 and f= 0). The other special cases (denormalized, infinity, and NaN) 
are not treated and, if present, will give erroneous results. 


The fast version of the IEEE-to ’C3x conversion routine was originally developed 
by Keith Henry of Apollo Computer, Inc. The other routines were based on this 
initial input. 


Example 5-4. IEEE-to-TMS320C3x Conversion (Fast Version) 


t+ FF FF HF HF HF HF HF HF 


t+ Ft FF HF 


+ FF 


TITLE IEEE TO TMS320C3x CONVERSION (FAST VERSION) 


SUBROUTINE FMIEEE 


FUNCTION: CONVERSION BETWEEN THE IEEE FORMAT AND TH 
TMS320C3x FLOATING-POINT FORMAT. THE NUMBER TO 

BE CONVERTED IS IN THE LOWER 32 BITS OF RO. 

THE RESULT IS STORED IN THE UPPER 32 BITS OF RO. 
UPON ENTERING THE ROUTINE, AR1 POINTS TO THE 
FOLLOWING TABLE: 


G 


(0) OXFF800000 <—- AR1 
(1) OxFFO00000 
(2) 0x7F000000 
(3) 0x80000000 
(4) 0x81000000 


ARGUMENT ASSIGNMENTS: 


ARGUMENT | FUNCTION 

Se oe ar an Fe a ea Re Se 
RO | NUMBER TO BE CONVERTED 

AR1 | POINTER TO TABLE WITH CONSTANTS 


REGISTERS USED AS INPUT: RO, ARI 
REGISTERS MODIFIED: RO, R1 
REGISTER CONTAINING RESULT: RO 
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Example 5-4. |EEE-to-TMS320C3x Conversion (Fast Version) (Continued) 


+ 


FMI 


NEG 


NOTE: SINCE THE STACK POINTER SP IS USED, MAKE SURE TO 


CYCLES: 12 (WORST CASE) WORDS: 12 


INITIALIZE IT IN THE CALLING PROGRAM. 


-global FMIEEE 


EEE AND3 RO, *AR1,R1 ; Replace fraction with 0 
BND NEG ; Test sign 
ADDI RO,R1 ; Shift sign 
; and exponent inserting 0 
LDIZ *4+AR1(1),R1 ; If all 0, generate C30 0 
SUBI *4+AR1(2),R1 ; Unbias exponent 
PUSH R1 
POPF RO ; Load this as a flt. pt. number 
RETS 
PUSH R1 
POPF RO ; Load this as a flt. pt. number 
NEGF RO,RO ; Negate if orig. sign is negative 
RETS 
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Example 5-5 shows the complete conversion between IEEE and ’C3x formats. 
In addition to the general case and the Os, it handles the special cases as follows: 


Lj If NaN (e = 255, f< >0), the number is returned intact. 


1] Ifinfinity (e = 255, f = 0), the output is saturated to the most positive or nega- 
tive number, respectively. 


(1 If denormalized (e = 0, f< >0), two cases are considered. If the MSB of f is 
1, the number is converted to ’C3x format. Otherwise an underflow occurs, 
and the number is set to 0. 


Floating-Point Conversion (IEEE Std. 754) 


Example 5-5. IEEE-to-TMS320C3x Conversion (Complete Version) 


* TITLE IEEE TO TMS320C3x CONVERSION (COMPLETE VERSION) 
* 
* 
* SUBROUTINE FMIEEER1 
* 
* FUNCTION: CONVERSION BETWEEN THE IEEE FORMAT AND THE TMS320C3x 
* FLOATING-POINT FORMAT. THE NUMBER TO BE CONVERTED 
* IS IN THE LOWER 32 BITS OF RO. THE RESULT IS STORED 
* IN THE UPPER 32 BITS OF RO. 
* 
* 
* UPON ENTERING THE ROUTINE, AR1 POINTS TO THE FOLLOWING TABLE: 
* 
* 0) OxFF800000 <-- ARI 
* 1) OxFF000000 
* 2) 0x7F000000 
* 3) 0x80000000 
* 4) 0x81000000 
* 5) 0x7F800000 
* 6) 0x00400000 
* 7) OXOO7FFFFF 
* 8) OX7F7FFFFF 
* 
* ARGUMENT ASSIGNMENTS: 
* ARGUMENT | FUNCTION 
* a a a TE 5 aN a NN SE TEE 
* RO | NUMBER TO BE CONVERTED 
* ARI | POINTER TO TABLE WITH CONSTANTS 
* 
* REGISTERS USED AS INPUT: RO, AR1 
* REGISTERS MODIFIED: RO, R1 
* REGISTER CONTAINING RESULT: RO 
* 
* NOTE: SINCE THE STACK POINTER SP IS USED, MAKE SURE TO 
* INITIALIZE IT IN THE CALLING PROGRAM. 
* 
* 
* CYCLES: 23 (WORST CASE) WORDS: 34 
* 
-global FMIEEE1 
* 
FMIEEEL LDI RO,R1 
AND *4+AR1(5),R1 
BZ UNNORM Fi If e = 0, number is either 0 or 
* ; denormalized 
XOR *4+AR1(5),R1 
BNZ NORMAL i If e < 255, use regular routine 
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Example 5-5. IEEE-to-TMS320C3x Conversion (Complete Version) (Continued) 


* HANDLE NaN AND INFINITY 


TSTB *+AR1(7),RO 
RETSNZ 
LD] RO,RO 


LDFGT *+AR1(8),RO 


LDFN *+AR1(5),RO 


UNNORM TSTB *+AR1(6),RO 
LDFZ *+AR1(3),RO 
RETSZ 
XOR *+AR1(6),RO 
BND NEG1 
LSH 1,R0 
SUBI *+AR1(2),RO 
PUSH RO 
POPF RO 
RETS 
NEG1 POPF RO 
NEGF RO,RO 
RETS 
= HANDLE THE REGULAR CASES 


BND NEG 
ADDI RO,R1 


PUSH R1 
POPF RO 
RETS 

NEG POPF RO 
NEGF RO, RO 
RETS 


= HANDLE Os AND UNNORMALIZED NUMBERS 


NORMAL AND3 RO, *AR1,R1 


SUBI *+AR1(2),R1 


Return if NaN 


If positive, infinity = 
most positive number 


If negative, infinity = 
most negative number RETS 


Is the MSB of £ equal to 1? 
T£ not, force the number to 0 
and return 


If MSB of f = 1, make it 0 
Eliminate sign bit 
& line up mantissa 


Make e = -127 


Put number in floating point format 


If negative, negate RO 


Replace fraction with 0 

Test sign 

Shift sign and exponent inserting 0 
Unbias exponent 


Load this as a flt. pt. number 


Load this as a flt. pt. number 
Negate if original sign negative 
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5.4.2 Converting 2s-Complement TMS320C3x Floating-Point Format to IEEE Format 


Table 5-2. Converting 2s-Complement Floating-Point Format to IEEE Format 


Case 
1 


2 
3 
4 
5 
6 


This conversion is performed according to the following table: 


If these values are present 


Then these values equal 


e2 SQ fp ClEEE SIEEE flEEE 
-128 00h 0 00 0000h 
-127 00h 0 00 0000h 
~126< @9 <127 0 @5+7Fh 0 fo 

~126< ey <127 1 #0 @o+7Fh 0 Fovtt 
~126< @5 <127 1 0 @5+80h 1 00 0000h 
127 1 0 FFh 1 00 0000h 


t f =2s-complement of fo. 


Case 1 maps a 2s-complement 0 to a positive IEEE 0. 


Case 2 maps the 2s-complement numbers that are too small to be repre- 
sented as normalized IEEE numbers to a positive IEEE 0. 


Case 3 maps the positive 2s-complement numbers that are not covered by 
case 2 into the identically valued IEEE number. 


Case 4 maps the negative 2s-complement numbers with a nonzero fraction 
that are not covered in case 2 into the identically valued IEEE number. 


Case 5 maps all the negative 2s-complement numbers with a 0 fraction, except 
for the most negative 2s-complement number and those that are not covered 
in case 2, into the identically valued IEEE number. 


Case 6 maps the most negative 2s-complement number to the IEEE negative 
infinity. 
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5.4.2.1. TMS320C3x-to-IEEE Floating-Point Format Conversion 


The vast majority of the numbers represented by the ’C3x floating-point format 
are covered by the general IEEE format and the representation of Os. The only 
special case is e = -127 in the ’C3x format; this corresponds to a denormalized 
number in IEEE format. It is ignored in the fast version, while it is treated properly 
in the complete version. Example 5-6 shows the fast version, and Example 5-7 
shows the complete version of the ’C3x-to-IEEE conversion. 


Example 5-6. TMS320C3x-to-IEEE Conversion (Fast Version) 


+ F FF HF HF HF HF HF HF 


t+ FF HF FF HF 


+ FF 


+ FF OF 


TITLE TMS320C3x TO IEEE CONVERSION (FAST VERSION) 


SUBROUTINE TOIEEE 


FUNCTION: CONVERSION BETWEEN THE TMS320C3x FORMAT AND THE IEEE 
FLOATING-POINT FORMAT. THE NUMBER TO BE CONVERTED 

IS IN THE UPPER 32 BITS OF RO. THE RESULT WILL BE IN 

THE LOWER 32 BITS OF RO. 


UPON ENTERING THE ROUTINE, AR1 POINTS TO THE FOLLOWING TABLE: 


(0) OXFF800000 <-—- AR1 
(1) OxFFO000000 
(2) 0x7F000000 
(3) 0x80000000 
(4) 0x81000000 


ARGUMENT ASSIGNMENTS: 


ARGUMENT | FUNCTION 

Stam ees pss ee ee a ee ee ee, 
RO | NUMBER TO BE CONVERTED 
AR1 | POINTER TO TABLE WITH CONSTANTS 


REGISTERS USED AS INPUT: RO, AR1 
REGISTERS MODIFIED: RO 
REGISTER CONTAINING RESULT: RO 


NOTE: SINCE THE STACK POINTER ‘SP’ IS USED, MAKE SURE TO 
INITIALIZE IT IN THE CALLING PROGRAM. 


5-22 


Floating-Point Conversion (IEEE Std. 754) 


Example 5-6. TMS320C3x-to-IEEE Conversion (Fast Version) (Continued) 


* CYCLES: 14 (WORST CASE) WORDS: 15 
* 
-global TOIEEE 
* 
TOLEEE iDF RO,RO ; Determine the sign of the number 
LDFZ *+AR1 (4) ,RO If 0, load appropriate number 
BND NEG ; Branch to NEG if negative (delayed) 
ABSF RO 7 Take the absolute value of the number 
LSH 1,RO0 7 Eliminate the sign bit in RO 
PUSHF RO 
POP RO ; Place number in lower 32 bits of RO 
ADDI *4+AR1 (2) ,RO - Add exponent bias (127) 
LSH —1,R0 ; Add the positive sign 
RETS 
NEG POP RO j Place number in lower 32 bits 
; of RO 
ADDI *4+AR1 (2) ,RO : Add exponent bias (127) 
LSH =1R0 ; Make space for the sign 
ADDI *4+AR1(3),RO ; Add the negative sign 
RETS 
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Example 5-7. TMS320C3x-to-IEEE Conversion (Complete Version) 


+ OF FF HHH 


* 


TITLE TMS320C3x TO IEEE CONVERSION (COMPLETE VERSION) 


SUBROUTINE TOIEEE1 


FUNCTION: CONVERSION BETWEEN THE TMS320C3x FORMAT AND THE IEEE 
FLOATING-POINT FORMAT. THE NUMBER TO BE CONVERTED 

IS IN THE UPPER 32 BITS OF RO. THE RESULT WILL BE 

IN THE LOWER 32 BITS OF RO. 


UPON ENTERING THE ROUTINE, AR1 POINTS TO THE FOLLOWING TABLE: 


OxFF800000 <—— AR1 
OxFFO00000 
0x7FO000000 
0x80000000 
0x81000000 
0x7F800000 
0x00400000 
OxOO7FFFFF 
OxX7F7FFFFF 


OaNAHUPWNEHEO 


ARGUMENT ASSIGNMENTS: 


ARGUMENT | FUNCTION 

ae era eee nee as ee 
RO | NUMBER TO BE CONVERTED 

AR1 | POINTER TO TABLE WITH CONSTANTS 


REGISTERS USED AS INPUT: RO, AR1 
REGISTERS MODIFIED: RO 
REGISTER CONTAINING RESULT: RO 


NOTE: SINCE THE STACK POINTER ‘SP’ IS USED, MAKE SURE TO 
INITIALIZE IT IN THE CALLING PROGRAM. 


CYCLES: 31 (WORST CASE) WORDS: 25 


-global TOIEEE1 
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Example 5-7. TMS320C3x-to-IEEE Conversion (Complete Version) (Continued) 


TOIEEI 


iDF 


CONT 


NEG 


LDFZ 
BND 
ABSF 


LSH 
PUSHF 
POP 
ADDI 
LSH 


TSTB 
RETSNZ 
TSTB 
RETSZ 
PUSH 
POPF 
LSH 
PUSHF 
POP 
ADDI 
RETS 


POP 
BRD 
ADDI 
LSH 
ADDI 
RETS 


RO, RO 
*+AR1 (4) ,RO 
NEG 

RO 


1,R0 

RO 

RO 
*+AR1 (2) ,RO 
—1,R0 


*+AR1(5),RO 
*+AR1(7) ,RO 


RO 

RO 

—1,R0 

RO 

RO 
*+AR1 (6) ,RO 


RO 

CONT 
*+ARI (2) ,RO 
—1,R0 
*+AR1 (3) ,RO 


Determine the sign of the number 
If 0, load appropriate number 
Branch to NEG if negative (delayed) 
Take the absolute value 

of the number 
Eliminate the sign bit in RO 


Place number in lower 32 bits of RO 


Add exponent bias (127) 
Add the positive sign 


If e > 0, return 


Ife =0 & f = 0, return 


Shift f right by one bit 


Add 1 to the MSB of f 


Place number in lower 32 bits of RO 


Add exponent bias (127) 
Make space for the sign 
Add the negative sign 
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5.5 Floating-Point Multiplication 
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A floating-point number a can be written in floating-point format as in the following 
formula, where o(man) is the mantissa and «(exp) is the exponent: 


a = o(man) x 20( exp) 
The product of « and bis c, defined as: 


Cc = «x b=a(man) x b(man) x 2(0(exp) + (exp) 


thus: 
c(man) = o(man) x b(man) 
c(exp) = a(exp) + b(exp) 


During floating-point multiplication, source operands are in the single-precision 
floating-point format. If the source operands are in short floating-point format, 
they are converted to single-precision floating-point format. If the source oper- 
ands are in extended-precision floating-point format, they are truncated to 
single-precision format. These conversions occur automatically in hardware 
with no overhead. All results of floating-point multiplications are in the extended- 
precision format. These multiplications occur in a single cycle. 


Figure 5—16 is a flowchart that shows the steps involved in floating-point multi- 
plication. Each step is labelled with a number in parenthesis. 


11 In step 1, the 24-bit source operand mantissas are multiplied, producing 
a 50-bit result c(man). (Input and output data are always represented as 
normalized numbers.) 


_j In step 2, the exponents, «(exp) and b(exp), are added, yielding c(exp). 


[1 Step 3 checks for whether c(man) in extended-precision format is equal to 
0. If c(man) is 0, step 7 sets c(exp) to -128, thus yielding the representation 
for 0. 


Lj Steps 4 and 5 normalize the result. 


Li If aright shift of 1 is necessary, then in step 8, c(man) is right-shifted one 
bit, thus adding 1 to c(exp). 


Li If aright shift of 2 is necessary, then in step 9, c(man) is right-shifted two 
bits, thus adding 2 to c(exp). Step 6 occurs when the result is normalized. 


Lj Instep 10, c(man) is set in the extended-precision floating-point format. 


_} Steps 11 through 16 check for special cases of c(exp). 


| 


Floating-Point Multiplication 


If c(exp) has overflowed (step 11) in the positive direction, then step 14 
sets c(exp) to the most positive extended-precision format value. If c(exp) 
has overflowed in the negative direction, then step 14 sets c(exp) to the 
most negative extended-precision format value. 


If c(exp) has underflowed (step 12), then step 15 sets c to 0; that is, 
c(man) = 0 and c(exp) = -128. 
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Floating-Point Multiplication 


Figure 5-16. Flowchart for Floating-Point Multiplication 


a.(man) b(man) a.(exp) b(exp) 
(1) (2) 
c(man) = a(man) x b(man) c(exp) = a(exp) + b(exp) 


(50-bit result) 


Test for special cases of c(man) 


(4) (5) (6) 
Right shift 1 Right shift 2 No shift 
to normalize to normalize to normalize 


c(man) > > 1 c(man) >> 2 
and and 
c(exp) = c(exp) + 1 c(exp) = c(exp) + 2 


Dispose of extra bits (10) 


Put c(man) in extended pre- 
cision floating-point format 


Test for special cases of c(exp) 


(11) (12) (13) 
c(exp) overflow c(exp) underflow c(exp) in range 


If c(man) > 0, 
set c(exp) to most 
positive value 


If cman) < 0, 
set c(exp) to most 
negative value 


Set c to final result (16) 


Vv 
c=axb 
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Example 5-8 through Example 5-12 illustrate how floating-point multiplication 
is performed on the ’C3x. For these examples, the implied most significant 
nonsign bit is made explicit. 


Example 5-8. Floating-Point Multiply (Both Mantissas = —2.0) 


Let: 


o = -2.0 x 24(€XxP) = 10.00000000000000000000000 x 2%(€xP) 
b = -2.0 x 24(€xP) = 10.00000000000000000000000 x 2exP) 


Where: 


a and b are both represented in binary form according to the normalized 
single-precision floating-point format. 


Then: 


10.00000000000000000000000 x 20(exP) 
x 10.00000000000000000000000 x 2 exp) 


0100.000000000000000000000000000000000000000000000 x 2 («(exP) + b(exp)) 


To place this number in the proper normalized format, it is necessary to shift 
the mantissa two places to the right and add 2 to the exponent. This yields: 


10.00000000000000000000000 x 20(exP) 
x 10.00000000000000000000000 x 24 exp) 


0100.0000000000000000000000000000000000000000000000 x 2 («(exp) + Hexp)) 


In floating-point multiplication, the exponent of the result may overflow. This can 
occur when the exponents are initially added or when the exponent is modified 
during normalization. 
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Example 5-9. Floating-Point Multiply (Both Mantissas = 1.5) 


Let: 


a = 1.5 x 2€x) = 01.0000000000000000000000 x 2%(exP) 
b = 1.5 x 24€x0) = 01.0000000000000000000000 x 2%exP) 


Where: 


aand bare both represented in binary form according to the single-preci- 
sion floating-point format. 


Then: 


10.00000000000000000000000 x 2%(¢xP) 
x 10.00000000000000000000000 x 24% exp) 


01.0000000000000000000000000000000000000000000000 x 2 (*exe) + Kexp) + 2) 
To place this number in the proper normalized format, it is necessary to shift 
the mantissa one place to the right and add 1 to the exponent. This yields: 


01.0000000000000000000000 x 2%?) 
x 01.0000000000000000000000 x 24%exP) 


01. 00100000000000000000000000000000000000000000000 x 2 (*exA) + exp) + 1) 


Example 5-10. Floating-Point Multiply (Both Mantissas = 1.0) 


Let: 


a = 1.0 x 2%xP) = 01.00000000000000000000000 x 2%( xP) 
b = 1.0 x 24x) = 01.00000000000000000000000 x 24% exP) 


Where: 


a and bare both represented in binary form according to the single-preci- 
sion floating-point format. 


Then: 


01.00000000000000000000000 x 2%(xP) 
x 01.00000000000000000000000 x 24%exP) 


0001 .0000000000000000000000000000000000000000000000 y 2 (dexP) + exp) 


This number is in the proper normalized format. Therefore, no shift of the 
mantissa or modification of the exponent is necessary. 


The previous examples show cases where the product of two normalized 
numbers can be normalized with a shift of 0, 1, or 2. The floating-point format 
of the ‘C3x makes this possible. 
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Example 5-11. Floating-Point Multiply Between Positive and Negative Numbers 


Let: 


ao = 1.0 x 2%€x) = 01.00000000000000000000000 x 2%(exP) 
b = -2.0 x 24x) = 10.00000000000000000000000 x 2% xP) 


Then: 


01.00000000000000000000000 x 2%exe) 
x 10.00000000000000000000000 x 2% xP) 


1110.0000000000000000000000000000000000000000000000 x 2 (exp) + exp) 


The resultis: c= —2.0 x 2(a(exp) + bexp)) 
Example 5-12. Floating-Point Multiply by 0 


All multiplications by a floating-point 0 yield a result of 0 (f= 0, s = 0, and 
exp = -128). 
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5.6 Floating-Point Addition and Subtraction 
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In floating-point addition and subtraction, two floating-point numbers o and b 
can be defined as: 


a = o(man) x 2 &(exp) 
b = b(man) x 2 exp) 
The sum (or difference) of « and b can be defined as: 


c=atb 
= (a(man) + (b(man) x 2 -(a( exp) - H(exp))) x 2 &( EXP), if a(exp) > b(exp) 
= (a(man) x 2 -(Hexp)-a(exp))) + b(man)) x 2 &XP), if (exp) < b(exp) 


Figure 5-17 shows the flowchart for floating-point addition. Because this flow- 
chart assumes signed data, it is also appropriate for floating-point subtraction. 
In this figure, it is assumed that a(exp) < b(exp). 


Lj In step 1, the source exponents, «(exp) and b(exp), are compared, and 
c(exp) is set equal to the largest of the two source exponents. 


_j In step 2, dis set to the difference of the two exponents. 


Lj In step 3, the mantissa with the smallest exponent, in this case a(man), 
is right-shifted d bits to align the mantissas. 


Lj In step 4, after the mantissas have been aligned, they are added. 


_j In steps 5 through 7, a check for a special case of c(man). If c(man) is 0 
(step 5), then c(exp) is set to its most negative value (step 8) to yield the 
correct representation of 0. If c(man) has overflowed c (step 6), then in 
step 9 c(man) is right-shifted one bit and 1 is added to c(exp). In step 10, 
the result is normalized. 


Lj) Steps 11 through 13 check for special cases of c(exp). If c(exp) has over- 
flowed (step 11) in the positive direction, then step 14 sets c(exp) to the 
most positive extended-precision format value. If c(exp) has overflowed 
(step 11) in the negative direction, then step 14 sets c(exp) to the most 
negative extended-precision format value. If c(exp) has underflowed (step 
12), then step 15 sets cto 0; that is, c(man) = 0 and c(exp) = -128. If no 
overflow or underflow occurred, then cis not modified. 


Floating-Point Addition and Subtraction 


Figure 5-17. Flowchart for Floating-Point Addition 


a.(man) b(man) a.(exp) b(exp) 
(1) 
If a(exp) < = b(exp) 
c(exp) = b(exp) 
(3) else 


c(exp) = (exp) 


a(man) = (man) > > d 


(Assume for simplicity 
that (exp) < = b(exp)) 


Discard LSBs to keep Fi 
o(man) in extended- 
precision floating- (2) Subtract exponents 


point format d= b(exp) — a(exp) 


(a 
c(man) = (man) + b(man) 


Test for special cases of c(man) 
(6) (7) 


5 
” k = # of leading 
ce(man) = 0 Overflow of c(man) non-significant 


sign bits 


c(man) = c(man) > > 1 
c(exp) = c(exp) + 1 
Discard LSBs to keep in 
extended-precision 
floating-point format 


c(exp) 


Test for special cases of c(exp) 


(11) (12) (13) 
c(exp) overflow c(exp) underflow c(exp) in range 


If c(man) > 0, set cto 0 
set cto most c(exp) = -128 
positive value 

c(man) = 0 


If c(man) < 0, 
set cto most 
negative value 


Set c to final result 


c=a+b 
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The following examples describe the floating-point addition and subtraction 
operations. It is assumed that the data is in the extended-precision floating- 
point format. 


Example 5-13. Floating-Point Addition 


In the case of two normalized numbers to be summed, let 

a = 1.5 = 01.1000000000000000000000000000000 x 29 

b = 0.5 = 01.0000000000000000000000000000000 x 2-1 
It is necessary to shift b to the right by 1 so that « and b have the same exponent. 
This yields: 

b = 0.5 = 00.1000000000000000000000000000000 x 2° 


Then: 


01.10000000000000000000000000000000 x2 
+00.10000000000000000000000000000000 x 20 


010.00000000000000000000000000000000 x 29 


As in the case of multiplication, it is necessary to shift the binary point one place 
to the left and add 1 to the exponent. This yields: 


01.1000000000000000000000000000000 x 2° 
+ 00.1000000000000000000000000000000 x 29 


01.0000000000000000000000000000000 x 21 
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Example 5-14. Floating-Point Subtraction 


A subtraction is performed in this example. Let: 
a. = 01.0000000000000000000000000000001 x 2° 
b = 01.0000000000000000000000000000000 x 2° 


The operation performed is ~—b. The mantissas are already aligned because 
the two numbers have the same exponent. The result is a large cancellation 
of the upper bits, as shown below. 


01.0000000000000000000000000000001 x 20 
—01.0000000000000000000000000000000 x 20 


00.0000000000000000000000000000001 x 20 


The result must be normalized. In this case, a left shift of 31 is required. The 
exponent of the result is modified accordingly. The result is: 


01.0000000000000000000000000000001 x 20 
— 01.0000000000000000000000000000000 x 2° 


01.0000000000000000000000000000000 x 2-31 


Example 5-15. Floating-Point Addition With a 32-Bit Shift 


This example illustrates a situation where a full 32-bit shift is necessary to 
normalize the result. Let: 


oe = 01.14411411419111111114111111111111 x 2127 
b = 10.0000000000000000000000000000000 x 2127 
The operation to be performed is a + b. 


O4.11411449144994111114111111111111 x 2127 
+ 10.0000000000000000000000000000000 x 2127 


414.4944994994114411111111111111111 x 2127 


Normalizing the result requires a left shift of 32 and a subtraction of 32 from 
the exponent. The result is: 


O4.41141144941991111111111111111111 x 2127 
+ 10.0000000000000000000000000000000 x 2127 


44.49449449941144911411111111111111 x 2127 
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Example 5-16. Floating-Point Addition/Subtraction With Floating-Point 0 


When floating-point addition and subtraction are performed with a floating- 
point 0, the following identities are satisfied: 


a+0 = a(a#0) 
0+0=0 
0 -a= -a(a#0) 
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5.7 Normalization Using the NORM Instruction 


The NORM instruction normalizes an extended-precision floating-point number 
that is assumed to be unnormalized (see Example 5-17). Since the number is 
assumed to be unnormalized, no implied most significant nonsign bit is assumed. 
The NOR\M instruction: 


1) Locates the most significant nonsign bit of the floating-point number 
2) Left shifts to normalize the number 
3) Adjusts the exponent 


Example 5-17. NORM Instruction 


Assume that an extended-precision register contains the value: 
man = 00000000000000000001000000000001, exp = 0 


When the normalization is performed on a number assumed to be unnormalized, 
the binary point is assumed to be: 


man = 0.0000000000000000001000000000001, exp = 0 


This number is then sign-extended one bit so that the mantissa contains 33 bits: 


man = 00.0000000000000000001000000000001, exp = 0 


The intermediate result after the most significant nonsign bit is located and the 
shift performed is: 


man = 01.0000000000010000000000000000000, exp = -19 
The final 32-bit value output after removing the redundant bit is: 


man = 00000000000010000000000000000000, exp = -19 


The NORM instruction is useful for counting the number of leading Os or leading 
1s in a 32-bit field. If the exponent is initially 0, the absolute value of the final 
value of the exponent is the number of leading 1s or Os. This instruction is also 
useful for manipulating unnormalized floating-point numbers. 


Given the extended-precision floating-point value a to be normalized, the 
normalization, norm ( ), is performed as shown in Figure 5-18. 
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Figure 5-18. Flowchart for NORM Instruction Operation 


a 


Test for special cases of c(man 


(man) 
(1) (2) 
a (man) =0 Leading nonsignificant sign bits 
k = # of leading 


nonsignificant 
sign bits 


c(exp) = -128 . . 

Sign-extended a(man) 1 bit 
c(man) = a(man) <<k 
c(exp) = a(exp) -k 


Remove most significant nonsign bit 
Test for special cases of c(exp) 


(6) (7) 
c(exp) c(exp) in 
underflow range 


c(exp) = -128 
No change to c(man) 


(9) Set c to final result 
Vv 
c = norm(«) 
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5.8 Rounding (RND Instruction) 


The RND instruction rounds a number from the extended-precision floating- 
point format to the single-precision floating-point format. Rounding is similar to 
floating-point addition. Given the number a to be rounded, the following opera- 
tion is performed first. 


© = oman) x 20(€XP) + (1 x 2oe(exp)-24) 


Next, a conversion from extended-precision floating-point to single-precision 
floating-point format is performed. Given the extended-precision floating-point 
value, the rounding, rnd( ), is performed as shown in Figure 5-19. 


NT 
Note: 


RND, src, dst—where (src) = 0O—does not set the 0 conditions flag (bit 2 in 
the status register). Instead, it sets the underflow condition flag (bit 4 in the 
status register). When required, check for the underflow condition instead of 


the 0 condition. 
—_—_—_———— eee 
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Rounding (RND Instruction) 


Figure 5-19. Flowchart for Floating-Point Rounding by the RND Instruction 


‘ 1X2 a(exp) —24 


Add o(man) and 1/2 of LSB 


c(man) = o( man) + 2-24 


Test for special cases of c(man) 
c(man) = 0 


Overflow of c(man) 


No special case 


c(man) = c(man) << 1 
c(exp) = a (exp) + 1 


Test for special cases of c(exp) 


c(exp) overflow 


c(exp) in range 


If c(man) > 0, 

set cto most positive 
single-precision value 
If c(man) < 0, 

set c to most negative 
single-precision value 


Set eight LSBs of c(man) to 0 


c= rnd(q) 


Floating-Point to Integer Conversion (FIX Instruction) 


5.9 Floating-Point to Integer Conversion (FIX Instruction) 


Using the FIX instruction, you can convert an extended-precision floating- 
point number to a single-precision integer in a single cycle. The floating-point 
to integer conversion of the value x is referred to here as fix(x). The conversion 
does not overflow if a, the number to be converted, is in the range: 


—231 <q < 231-1 
First, you must be certain that 
o(exp) < 30 


If these bounds are not met, an overflow occurs. If an overflow occurs in the 
positive direction, the output is the most positive integer. If an overflow occurs 
in the negative direction, the output is the most negative integer. If (exp) is 
within the valid range, then a(man), with implied bit included, is sign-extended 
and right-shifted (rs) by the amount 


rs = 31 — a(exp) 


This right shift (rs) shifts out those bits corresponding to the fractional part of 
the mantissa. For example: 


IfO <x <1, then fix(x) = 0 
If -1 <x <0, then fix(x) = -1 


Figure 5-20 shows the flowchart for the floating-point-to-integer conversion. 
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Floating-Point to Integer Conversion (FIX Instruction) 


Figure 5-20. Flowchart for Floating-Point to Integer Conversion by FIX Instruction 


a 


Test for special cases of a(exp) 


o(exp) in range 
rs = 31 - a(exp) 


If (man) > 0, 
c = most positive integer 


If a(man) < 0, 
c = most negative integer 


Set c to final result 


c= fix(a) 
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Integer to Floating-Point Conversion (FLOAT Instruction) 


5.10 Integer to Floating-Point Conversion (FLOAT Instruction) 
Integer to floating-point conversion, using the FLOAT instruction, allows 
single-precision integers to be converted to extended-precision floating-point 


numbers. The flowchart for this conversion is shown in Figure 5-21. 


Figure 5-21. Flowchart for Integer to Floating-Point Conversion by FLOAT Instruction 


a 


c(man) = a 
c(exp) = 30 


Test for special cases of c(man) 


Leading nonsignificant 


c(man) =0 sign bits 


k = # leading 
nonsignificant 
sign bits 


c(exp) = -128 c(man) = c(man) <<k 


c(exp) = 30 -k 


Remove most significant nonsign bit 


Set c to final result 


c = float (a) 
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5.11 Fast Logarithms on a Floating-Point Device 
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The following TMS320C30/C40 function calculates the log base two of a number 
in about half the time of conventional algorithms. Furthermore, the method can 
easily be scaled for faster execution if less accuracy is desired. The method is 
efficient because the algorithm uses the floating-point multipliers’ exponent/nor- 
malization hardware in a unique way. The following is a proof of the algorithm. 
The value of a floating point number X is given by: 


X = 2“EXP_old* mant_old 


Since the bit fields used to store the exponent and mantissa are actually inte- 
ger, the exponent is already in log2 (log base 2) form. In fact, the exponent is 
nothing more than a normalizing shift value. By converting both sides of the 
first equation to a logarithm, the logarithm of the value becomes the sum of the 
exponent and mantissa in log form: 


log2(X) = EXP_old + log2(mant_old) (Log base two) 


Since EXP is in the exponent register, no calculation is needed and the value 
can be used directly as an integer. To extract the value of the exponent, PUSH, 
POP, and masking operations are used. The remaining mantissa conversion 
is done by first forcing the exponent bits to zero using an LDE 1.0 instruction. 
This causes the exponent term 24EXP to equal 1.0, leaving 1.0 <= Value < 2.0. 
Then, by using the following identity, the logarithm of the mantissa can be ex- 
tracted from the final results exponent. If the value (mant_old) is repeatedly 
squared, the sequence becomes: 


X_new = mant_old*N 


where: 
1.0 X_new<24N 
N= 1,2,4,8,16... 


Since the hardware multiplier restructures the new value (X_new) during each 
squaring operation, X_new is represented by a new exponent (EXP_new) and 
mantissa (mant_new): 


X_new = 24EXP_New* mant_new 


By then applying familiar logarithm rules, we find that EXP_new holds the loga- 
rithm of Old_mant. This is best shown by setting the previous two equations 
equal to each other and taking the logarithm of both sides: 


mant_old*N = 2“EXP_new* mant_new 


N=1,2,4,8,16... 
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N* log2(mant_old) = EXP_new + log2(mant_new) 
log2(mant_old) = EXP_new/N + log2(mant_new)/N 


This last equation shows that the logarithm of mant_old is indeed related to 
EXP_new. And as shown earlier, EXP_new can be separated from the new 
mantissa and used as the logarithm of the original mantissa. 


We also need to consider the divisor N, which is defined to be the series 1, 2, 
4, 8, 16... , and EXP_new is an integer. The division by N becomes a shift for 
each squaring operation. What remains is to concatenate the bits of EXP_new 
to EXP_old and then repeat the process until the desired accuracy is achieved. 


5.11.1 Example of Fast Logarithm on a Floating-Point Device 


Consider a mantissa value of 1.5 and an exponent value of 0 (giving an expo- 
nent multiplier 2“0, or 1.0). The TMS320C30/C40 extended register bit pattern 
for the algorithm sequence is shown below. 


Table 5-3. Squaring Operation of FO = 1.5 


Exp 
00000000 
00000001 
00000010 
00000100 
00100100 
00010010 
00100101 
01001010 
XXXXXXXX 


“Ep 


nnC07 CO OD OG oO OlH 


Squaring Operation of FO = 1.5 


Mantissa 

1000000000000000000000000000000 X 215 Exp=0 

0010000000000000000000000000000 X2 =2.25 Exp=1 

0100010000000000000000000000000 X*4 =5.0625 Exp=2 

1001101000010000000000000000000 X*8 =25.628906 Exp=4 

0100100001101011101000001000000 X16 =656.84083 Exp=9 

1010010101010011111101110011111 X*32.- =431.43988-E3  Exp=18 
0101101010110110101000010101001 X64. =186.14037-E9 — Exp=37 
1101010110010010001010101100011 X28  =34.648238-E21 Exp=74 


MMMMMMMMMMMMMMMMMMMMMMMMMMMMMMM 


Mantissa. —————_ 


Hand-calculated value of log2(1.5) 

log2(1.5) = 0.58496250 = 1001010 111000000 

XXXXXXX-— first 7 bits (exponent) 

mmm- quick 3 bits (mantissa) 

If you compare the hand-calculated value and the binary representation of 


log2(1.5), you find that the sequence of bits in the exponent (seven bits worth) 
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are equivalent to the seven MSBs of the logarithm. If the exponent could hold 
all the bits needed for full accuracy, then it would be possible to continue the op- 
eration for all 24 bits of the mantissa. Since there are only eight bits in the expo- 
nent and the MSBs are used for negative values, only seven iterations are pos- 
sible before the exponent must be off-loaded and reinitialized to zero. 


By concatenating EXP_new to the previous exponent, longer strings of bits can 
be built for greater accuracy. The process is then repeated until the desired accu- 
racy is achieved. Also remember that the original numbers exponent, which rep- 
resents the whole number part of the result, becomes the eight MSBs of the final 
result. 


Another technique is to look at the three MSBs of the mantissa and apply a 
roundup from the fourth bit. Those same MSBs can be used as a quick exten- 
sion of the exponent (logarithm). To visualize this, consider the following tabu- 
lated values and graph. 


Mant —_log2(Mant) 


1.000 0.000 5 

1.250 0.322 a 
1.500 0.585 © 05 

1.7500 0.807 

1.999 0.999 


ee 
4 
| 


1.0 1.5 2.0 
Mantissa 


Figure 5-22. Tabulated Values for Mantissa 
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Note: 


The fractional part is the same at the endpoints. 
a) 
In the middle, only a slight bowing exists which can either be ignored or optionally 
rounded for better accuracy. The maximum actually occurs at a mantissa value 
of 


{ 


In(2.0) 


or 1.442695. The value of log2(mant) at that point is 0.52876637, giving a maxi- 
mum error of 0.086071. 
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When finished, the bits representing the finished logarithm are in a fixed-point 
notation and need to be scaled. This is done by using the FLOAT instruction fol- 
lowed by a multiplication by a constant scaling factor. If the final result needs to 
be in any other base, the scaling factor is simply adjusted for that base. 


5.11.2 Points to Consider 


The round-off accuracy of the first three squaring operations affect the final 
result if >21 mantissa bits are desired. A RND instruction placed after the first 
three MPYF RO,RO instructions remedy this, but adds to the cycle count. 


When the input value approaches 1.0, the result is driven close to zero and 
accuracy suffers. In this case, an input range comparison and a branch to a 
McLauren series expansion is used as a solution with minimal degradation in 
speed. This is because the power series converges quickly for input values 
close to 1.0. 


If you only need to calculate a visual quality logarithm, such as in spectrum 
analysis, the logarithm can often be calculated in one cycle. In this case the 
mantissa is substituted directly into the fractional bits of the logarithm giving 
a maximum error of 0.086 (about 3.5 bits). The one cycle arises from the need 
to remove the 2’s compliment sign bit in the ’C3x’s mantissa. 
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Figure 5-23. Fast Logarithm for FFT Displays 


* 


MPYF 
MYPF 
ADDF 
ASH 
STR 


* log E.asm 


_log_E:POP 
POPF 
LDF 


Dae 
U 
4 
Ww 


GOOG 
uN 

ty 

Hy 


PreamWuvys eS ES SESE 
U 
K 
ny 


O tv 
OK 
H 4 


.data 
CONST_ADR: 
CONST 


KR KKK KKK KKK KEK KKK KR KKK KKK KKK KKK KKK KKK KKK KK KKK KR KKK KKK KKEKKKKKKKKK OK 


FAST Logarithm for FFT displays * 


i >>>> NEED ONLY ADD ONE INSTRUCTION IN MANY CASES 


KKK KKK KKK KKK KKK RK KKK KKK KKK KK KR KKK KKK KKK KKK KKK KKK KK KKK KKKKKRKKEKKKKEK 


REAL, REAL, RO 
IMAG, IMAG, R1 
R1,R0 
-1,R0 
RO,OUT 


-global_log E 


AR1 

RO 
RO,R1 
2,RC 
loop 
7,R1 
1.0,RO 
RO, RO 
RO, RO 
RO, RO 
RO, RO 
RO, RO 
RO, RO 
RO, RO 
R1 

RO 

AR1 

RO 
@CONST, RO 
1,SP 


.word CONST 
. Long 0e7317219h 
.end 


<<<< * 


; Calculate the magnitude 
; Note: sign bit is zero 


; <-One instruction logarithm! 
; scaled externally in DAC 


/ 


KEKE KKK KKK KKK KKK KKK K KKK KKK KKK KK KKK KKK KKK KKK KKK KKK KEK KKK KKK KK KKK K KEKE KKK 


DEVICE: 


TMS320C30 7 


KRREKKKKE KKK KKK KRKEKKE KKK KKK RK KKK KEK KER KEK KKK KEKE KKK KEKE KER KEK KERR KKK KKEKEKEKRKKEKEKEEEE 


j; return address -> AR1 
; X -> RO 
; use R1 to accumulate answer 


7; repeat 3x 


; EXP = 0 

; mant*2 

; mant*4 

; mant*s 

; mant*16 

; mant*32 

; mant*64 

; mant*128 

; PUSH EXP and Mantissa (sign is now data!) 
; POP as ianteger (EXP+FRACTION) 


; convert EXP+FRACTION to float 
; scale the result by 2*-24 and change base 
; restore stack pointer 


;;Base e hand calc w/1 l1sb round 
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Chapter 6 


Addressing Modes 


The ’C3x supports five groups of powerful addressing modes. Six types of 
addressing that allow data access from memory, registers, and the instruction 
word can be used within the groups. This chapter describes the operation, 
encoding, and implementation of the addressing modes. It also discusses the 
management of system stacks, queues, and dequeues in memory. 
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Addressing Types 


6.1 Addressing Types 
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You can access data from memory, registers, and the instruction word by using 
five types of addressing: 


L 


L 


L) 


L 


Register addressing. A CPU register contains the operand. 


Direct addressing. The data address is formed by concatenating the 
eight least significant bits (LSBs) of the data-page (DP) register and the 
16 LSBs of the instruction. 


Indirect addressing. An auxiliary register contains the address of the 
operand. 


Immediate addressing. The operand is a 16-bit or 24-bit immediate value. 


PC-relative addressing. A 16-bit or 24-bit displacement to the program 
counter (PC). 


Two specialized modes are available for use in filters, FFTs, and DSP algorithms: 


L 


L 


Circular addressing. An auxiliary register is incremented/decremented 
with regards to a circular buffer boundary. 


Bit-reverse addressing. An auxiliary register is transferred to its bit- 
reversed representation that contains the address of the operand. 


6.2 Register Addressing 


Register Addressing 


In register addressing, a CPU register contains the operand, as shown in this 


example: 


ABSF RL 


; RL = [R1| 


The syntax for the CPU registers, the assembler syntax, and the assigned 
function for those registers are listed in Table 6-1. 


Table 6-1. CPU Register Address/Assembler Syntax and Function 


Register Name 


RO 
R1 
R2 
R3 
R4 
R5 
R6 
R7 
ARO 
AR1 
AR2 
AR3 
AR4 
ARS 
AR6 
AR7 


IRO 
IR1 


Machine Address 


00h 
Oih 
02h 
03h 
04h 
05h 
06h 
07h 
08h 
09h 
OAh 
OBh 


Assigned Function 


Extended-precision register 0 
Extended-precision register 1 
Extended-precision register 2 
Extended-precision register 3 
Extended-precision register 4 
Extended-precision register 5 
Extended-precision register 6 
Extended-precision register 7 
Auxiliary register 0 

Auxiliary register 1 

Auxiliary register 2 

Auxiliary register 3 

Auxiliary register 4 

Auxiliary register 5 

Auxiliary register 6 

Auxiliary register 7 
Data-page pointer 

Index register 0 

Index register 1 

Block-size register 

Active stack pointer 

Status register 

CPU/DMA interrupt-enable 
CPU interrupt flags 

I/O flags 

Repeat start address 

Repeat end address 

Repeat counter 
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6.3 Direct Addressing 


In direct addressing, the data address is formed by the concatenation of the 
eight LSBs of the data-page pointer (DP) with the 16 LSBs of the instruction 
word (expr). This results in 256 pages (64K words per page), allowing you to 
access a large address space without requiring a change of the page pointer. 
The syntax and operation for direct addressing are: 


Syntax: @expr 
Operation: address = DP concatenated with expr 


Figure 6-1 shows the formation of the data address. Example 6-1 is an instruc- 
tion example with data before and after instruction execution. 


Figure 6-1. Direct Addressing 


Instruction 
word 


pp —> 


(Data- 
page pointer) 


31 


Example 6-1. Direct Addressing 


ADDI @OBCDEh,R7 
Before Instruction After Instruction 


R7 00 0000 0000 R7 00 1234 5678 


DP 8A DP 8A 


Data memory 
8ABCDEh 1234 5678 8ABCDEh 1234 5678 
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Indirect Addressing 


6.4 Indirect Addressing 


Indirect addressing specifies the address of an operand in memory through the 
contents of an auxiliary register, optional displacements, and index registers as 
shown in Example 6-2. Only the 24 LSBs of the auxiliary registers and index 
registers are used in indirect addressing. The auxiliary register arithmetic units 
(ARAUs) perform the unsigned arithmetic on these lower 24 bits. The upper 
eight bits are unmodified. 


Example 6-2. Auxiliary Register Indirect 


An auxiliary register (ARn) contains the address of the operand to be fetched. 


Operation: operand address = ARn 
Assembler Syntax: *“ARn 
Modification Field: 11000 
31 2423 0 


ARn 


31 


The flexibility of indirect addressing is possible because the ARAUs on the 
°C3x modify auxiliary registers in parallel with operations within the main CPU. 
Indirect addressing is specified by a 5-bit field in the instruction word, referred 
to as the mod field (shown in Table 6-2). A displacement is either an explicit 
unsigned 8-bit integer contained in the instruction word or an implicit displace- 
ment of 1. Two index registers, IRO and IR1, can also be used in indirect 
addressing, enabling the use of 24-bit indirect displacement. In some cases, 
an addressing scheme using circular or bit-reversed addressing is optional. 
Generating addresses in circular addressing is discussed in Section 6.7 on 
page 6-21; bit-reversed addressing is discussed in Section 6.8 on page 6-26. 


Table 6-2 lists the various kinds of indirect addressing, along with the value 
of the modification (mod) field, assembler syntax, operation, and function for 
each. The succeeding 17 examples show the operation for each kind of indi- 
rect addressing. Figure 6-3 on page 6-20 shows the format in the instruction 
encoding. 


Addressing Modes 6-5 


Indirect Addressing 


Figure 6-2. Indirect Addressing Operand Encoding 
MSB LSB 
5 bits 3 bits 0, 5, or 8 bits 


Note: Auxiliary Register 
The auxiliary register (ARn) is encoded in the instruction word according to its 
binary representation n (for example, AR3 is encoded as 119), not its register 


machine address (shown in Table 6-1). 
| 
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Table 6-2. Indirect Addressing 


(a) Indirect addressing with displacement 


Mod Field 


00000 
00001 
00010 


00011 


00100 


00101 


00110 


00111 


Syntax 
*+ARn(disp) 
*-ARn(disp) 
*+4+ARn(disp) 


*—ARn(disp) 


*ARn++(disp) 


*ARn-—-— (disp) 


*ARn++(disp)% 


*ARn-—(disp)% 


Operation 
addr = ARn + disp 
addr = ARn - disp 


addr = ARn + disp 
ARn = ARn + disp 


addr = ARn - disp 
ARn = ARn - disp 


addr = ARn 
ARn = ARn + disp 


addr = ARn 
ARn = ARn - disp 


addr = ARn 


ARn = circ(ARn + disp) 


addr = ARn 


ARn = circ(ARn - disp) 


(b) Indirect addressing with index register [RO 


Mod Field 


01000 
01001 
01010 


01011 


01100 


01101 


01110 


01111 


Legend: 


addr 
ARn 
circ( 
disp 


Syntax 
*+ARn(IRO) 
*-ARn(IRO) 
*++ARn(IRO) 


*__ ARn(IRO) 


*ARn++(IRO) 


*ARn—-(IRO) 


*ARn++(IRO)% 


*ARn——(IRO)% 


Operation 
addr = ARn + IRO 
addr = ARn — IRO 


addr = ARn + IRO 
ARn = ARn + IRO 


addr = ARn — |IRO 
ARn = ARn —- IRO 


addr = ARn 
ARn = ARn + IRO 


addr= ARn 
ARn = ARn - IRO 


addr = ARn 


ARn = circ(ARn + IRO) 


addr = ARn 


ARn = circ(ARn- IRO) 


memory address 
auxiliary registers ARO-AR7 
) address in circular addressing 


displacement 


Indirect Addressing 


Description 
With predisplacement add 
With predisplacement subtract 


With predisplacement add and modify 


With predisplacement subtract and modify 


With postdisplacement add and modify 


With postdisplacement subtract and modify 


With postdisplacement add and circular modify 


With postdisplacement subtract and circular modify 


Description 

With preindex (IRO) add 

With preindex (IRO) subtract 

With preindex (IRO) add and modify 


With preindex (IRO) subtract and modify 


With postindex (IRO) add and modify 


With postindex (IRO) subtract and modify 


With postindex (IRO) add and circular modify 


With postindex (IRQ) subtract and circular modify 


++ add and modify 

-- subtract and modify 

% where circular addressing is performed 
IRn index register IRO or IR1 
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Table 6-2. Indirect Addressing (Continued) 


(c) Indirect addressing with index register IR1 


Mod Field Syntax 


Operation 


Description 


10000 
10001 
10010 


10011 


10100 


10101 


10110 


10114 


*4 ARn(IR1) 
*_ARn(IR1) 


*44ARn(IR1) 


*_-ARn(IR1) 


*ARn++(IR1) 


*ARn--(IR1) 


*ARn++(IR1)% 


*ARn--(IR1)% 


addr = ARn+|R1 
addr = ARn - |IR1 


addr = ARn+|IR1 
ARn = ARn + IR1 


addr = ARn - IR1 
ARn=ARn-IR1 


addr = ARn 
ARn = ARn + IR1 


addr = ARn 
ARn=ARn-IR1 


addr = ARn 
ARn = circ(ARn + IR1) 


addr = ARn 
ARn = circ(ARn - IR1) 


(d) Indirect addressing (special cases) 


Mod Field Syntax 
11000 *ARn 


11001 *ARn++ (IRO)B 


Legend: addr 
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Operation 
addr = ARn 


addr = ARn 
ARn = B(ARn + IRO) 


memory address 
auxiliary registers ARO-AR7 


where bit-reversed addressing is performed 


bit-reversed address 


With preindex (IR1) add 
With preindex (IR1) subtract 


With preindex (IR1) add and modify 


With preindex (IR1) subtract and modify 


With postindex (IR1) add and modify 


With postindex (IR1) subtract and modify 


With postindex (IR1) add and circular modify 


With postindex (IR1) subtract and circular modify 


Description 


Indirect 


With postindex (IRO) add 


and bit-reversed modify 


circ() address in circular addressing 


++ add and modify by 1 

-- subtract and modify by 1 

% where circular addressing is performed 
IRn index register IRO or IR1 


Example 6-3 through Example 6-19 show the operation for each type of indirect 
addressing. 


Indirect Addressing 


Example 6-3. Indirect Addressing With Predisplacement Add 


The address of the operand to fetch is the sum of an auxiliary register (ARn) and 
the displacement (disp). The displacement is either an 8-bit unsigned integer 
contained in the instruction word or an implied value of 1. 


Operation: operand address = ARn + disp 
Assembler Syntax: *+ARn(disp) 
Modification Field: 00000 
31 24 23 0 
31 8 7 0 
31 0 


Example 6-4. Indirect Addressing With Predisplacement Subtract 


The address of the operand to fetch is the contents of an auxiliary register 
(ARn) minus the displacement (disp). The displacement is either an 8-bit 
unsigned integer contained in the instruction word or an implied value of 1. 


Operation: operand address = ARn - disp 
Assembler Syntax: *_ARn(disp) 
Modification Field: 00001 
31 24 23 0 


Address 


31 8 7 0 
disp| 0 0...0 0 (-) 
31 0 
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Example 6-5. Indirect Addressing With Predisplacement Add and Modify 


The address of the operand to fetch is the sum of an auxiliary register (ARn) and 
the displacement (disp). The displacement is either an 8-bit unsigned integer 
contained in the instruction word or an implied value of 1. After the data is 
fetched, the auxiliary register is updated with the address generated. 


Operation: operand address = ARn + disp 
ARn = ARn + disp 
Assembler Syntax: *+4ARn (disp) 
Modification Field: 00010 
31 24 23 0 
fs] 


31 8 7 0 


31 0 


Example 6-6. Indirect Addressing With Predisplacement Subtract and Modify 
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The address of the operand to fetch is the contents of an auxiliary register 
(ARn) minus the displacement (disp). The displacement is either an 8-bit 
unsigned integer contained in the instruction word or an implied value of 1. 
After the data is fetched, the auxiliary register is updated with the address 
generated. 


Operation: operand address = ARn — disp 
ARn = ARn - disp 
Assembler Syntax: *__ARn(disp) 
Modification Field: 00011 
31 24 23 0 


31 8 7 0 


disp | 0 0...0 0 


ARn 


Indirect Addressing 


Example 6-7. Indirect Addressing With Postdisplacement Add and Modify 


The address of the operand to fetch is the contents of an auxiliary register (ARN). 
After the operand is fetched, the displacement (disp) is added to the auxiliary 
register. The displacement is either an 8-bit unsigned integer contained in the 
instruction word or an implied value of 1. 


Operation: operand address = ARn 
ARn = ARn + disp 
Assembler Syntax: *ARn++ (disp) 
Modification Field: 00100 
31 24 23 0 


ARn 


31 8 7 0 
disp| 0 0...0 0| Integer | (+) 


31 0 


Example 6-8. Indirect Addressing With Postdisplacement Subtract and Modify 


The address of the operand to fetch is the contents of an auxiliary register 
(ARn). After the operand is fetched, the displacement (disp) is subtracted from 
the auxiliary register. The displacement is either an 8-bit unsigned integer 
contained in the instruction word or an implied value of 1. 


Operation: operand address = ARn 
ARn = ARn - disp 
Assembler Syntax: *ARn—-— (disp) 
Modification Field: 00101 
31 24 23 0 


ARn 


31 8 7 0 
disp} 0 0...0 0| integer | (-) 


31 0 
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Example 6-9. Indirect Addressing With Postdisplacement Add and Circular Modify 


The address of the operand to fetch is the contents of an auxiliary register 
(ARn). After the operand is fetched, the displacement (disp) is added to the 
contents of the auxiliary register using circular addressing. This result is used 
to update the auxiliary register. The displacement is either an 8-bit unsigned 
integer contained in the instruction word or an implied value of 1. 


Operation: operand address = ARn 
ARn = circ(ARn+ disp) 
Assembler Syntax: *ARn ++ (disp)% 
Modification Field: 00110 
31 24 23 0 
31 8 7 0 (%) 
] 


31 0 


Example 6-10. Indirect Addressing With Postdisplacement Subtract and Circular Modify 


6-12 


The address of the operand to fetch is the contents of an auxiliary register 
(ARn). After the operand is fetched, the displacement (disp) is subtracted from 
the contents of the auxiliary register using circular addressing. This result is 
used to update the auxiliary register. The displacement is either an 8-bit 
unsigned integer contained in the instruction word or an implied value of 1. 


Operation: operand address = ARn 
ARn = circ(ARn — disp) 
Assembler Syntax: *ARn ——(disp)% 
Modification Field: 00111 
31 24 23 0 


ARn 


31 8 7 O (%) 
| 
disp| 0 0...0 o| Integer | C) 


31 0 
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Example 6-11. Indirect Addressing With Preindex Add 


The address of the operand to fetch is the sum of an auxiliary register (ARn) 
and an index register (IRO or IR1). 


Operation: operand address = ARn + IRm 
Assembler Syntax: *+ARn(IRm) 
Modification Field: 01000 ifm=0 


10000 ifm=1 


31 24 23 0 
31 24 23 0 
31 0 


Example 6-12. Indirect Addressing With Preindex Subtract 


The address of the operand to fetch is the difference of an auxiliary register 
(ARn) and an index register (IRO or IR1). 


Operation: operand address = ARn — IRm 
Assembler Syntax: *-ARn(IRm) 
Modification Field: 01001 ifm=0 
10001 ifm=1 
31 24 23 0 
Xx 
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Example 6-13. Indirect Addressing With Preindex Add and Modify 


The address of the operand to fetch is the sum of an auxiliary register (ARn) and 
an index register (IRO or IR1). After the data is fetched, the auxiliary register is 
updated with the generated address. 


Operation: operand address = ARn + IRm 
ARn = ARn + IRm 

Assembler Syntax: *+4+ARn(IRm) 

Modification Field: 01010 ifm =0 


10010 ifm =1 


31 24 23 0 


Example 6-14. Indirect Addressing With Preindex Subtract and Modify 


The address of the operand to fetch is the difference between an auxiliary register 
(ARn) and an index register (IRO or IR1). The resulting address becomes the new 
contents of the auxiliary register. 


Operation: operand address = ARn — IRm 
ARn = ARn - |IRm 
Assembler Syntax: *--ARn(IRm) 
Modification Field: 01011 ifm =0 
10011 ifm=1 


31 24 23 0 
X 
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Example 6-15. Indirect Addressing With Postindex Add and Modify 


The address of the operand to fetch is the contents of an auxiliary register 
(ARn). After the operand is fetched, the index register (IRO or IR1) is added 
to the auxiliary register. 


Operation: operand address = ARn 
ARn=ARn + IRm 

Assembler Syntax: *ARn++(IRm) 

Modification Field: 01100 ifm =0 


10100 ifm=1 


31 24 23 0 
ns 

31 24 23 0 
31 0 


Example 6-16. Indirect Addressing With Postindex Subtract and Modify 


The address of the operand to fetch is the contents of an auxiliary register (ARN). 
After the operand is fetched, the index register (IRO or IR1) is subtracted from 
the auxiliary register. 


Operation: operand address = ARn 
ARn = ARn - IRm 
Assembler Syntax: *ARn--(IRm) 
Modification Field: 01101 ifm = 0 
10101 ifm =1 
31 24 23 0 
ARn 
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Example 6-17. Indirect Addressing With Postindex Add and Circular Modify 


The address of the operand to fetch is the contents of an auxiliary register 
(ARn). After the operand is fetched, the index register (IRO or IR1) is added 
to the auxiliary register. This value is evaluated using circular addressing and 
replaces the contents of the auxiliary register. 


Operation: 


Assembler Syntax: 
Modification Field: 


operand address = ARn 
ARn = circ(ARn + IRm) 


*ARn++(IRM)% 


01110 ifm=0 
10110 ifm =1 
31 24 23 0 


Example 6-18. Indirect Addressing With Postindex Subtract and Circular Modify 


The address of the operand to fetch is the contents of an auxiliary register (ARN). 
After the operand is fetched, the index register (IRO or IR1) is subtracted from the 
auxiliary register. This result is evaluated using circular addressing and replaces 
the contents of the auxiliary register. 


Operation: 


Assembler Syntax: 
Modification Field: 


ARn 
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operand address = ARn 
ARn = circ(ARn — IRm) 


*ARn -- (IRm)% 


01111 ifm=0 
10111 ifm =1 
31 24 23 0 
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Example 6-19. Indirect Addressing With Postindex Add and Bit-Reversed Modify 


The address of the operand to fetch is the contents of an auxiliary register 
(ARn). After the operand is fetched, the index register (IRO) is added to the 
auxiliary register. This addition is performed with a reverse-carry propagation 
and can be used to yield a bit-reversed (B) address. This value replaces the 
contents of the auxiliary register. 


Operation: operand address = ARn 
ARn = B(ARn + IRO) 


Assembler Syntax: *ARn++(IRO)B 
Modification Field: 11001 
31 24 23 0 
ARn x x Address 
31 24 23 0 (B) 
31 0 
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6.5 Immediate Addressing 


In immediate addressing, the operand is a 16-bit (Short) or 24-bit (long) immediate 
value contained in the 16 or 24 LSBs of the instruction word (expr). Depending 
on the data types assumed for the instruction, the short-immediate operand can 
be a 2s-complement integer, an unsigned integer, or a floating-point number. This 
is the syntax for this mode: 


Syntax: expr 


Example 6-20 illustrates an instruction example with data before and after the 
instruction is executed. 


Example 6-20. Short-lmmediate Addressing 


SUBI 1,R0 
Before Instruction After Instruction 


RO | 00 0000 0000 RO | 00 FFFF FFFF 


In long-immediate addressing, the operand is a 24-bit unsigned immediate 
value contained in the 24 LSBs of the instruction word (expr). This is the syntax 
for this mode: 


Syntax: expr 


Example 6-21 illustrates an instruction example with data from before and after 
the instruction is executed. 


Example 6-21. Long-lmmediate Addressing 
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BR 8000h 
Before Instruction After Instruction 


PC 0000 8000 
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6.6 PC-Relative Addressing 


Program counter (PC)-relative addressing is used for branching. It adds the 
contents of the 16 or 24 LSBs of the instruction word to the PC register. The 
assembler takes the src (a label or address) specified by the user and generates 
a displacement. If the branch is a standard branch, this displacement is equal to 
[label — (instruction address +1)]. If the branch is a delayed branch, this displace- 
ment is equal to [label — (instruction address +3)]. 


The displacement is stored as a 16-bit or 24-bit signed integer in the LSBs of 
the instruction word. The displacement is added to the PC during the pipeline 
decode phase. Notice that because the PC is incremented by 1 in the fetch 
phase, the displacement is added to this incremented PC value. 


Syntax: expr (src) 


Example 6-22 illustrates an example with data from before and after the 
instruction is executed. 


Example 6-22. PC-Relative Addressing 


NEWPC ; pc=1001h, NEWPC label = 1005h, displacement 


Before Instruction After Instruction 


decode phase: execution phase: 


PC PC 1005 


The 24-bit addressing mode encodes the program-control instructions (for 
example, BR, BRD, CALL, RPTB, and RPTBD). Depending on the instruc- 
tion, the new PC value is derived by adding a 24-bit signed value in the 
instruction word with the present PC value. Bit 24 determines the type of 
branch (D = 0 for a standard branch or D = 1 for a delayed branch). Some of 
the instructions are encoded in Figure 6-3. 
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Figure 6-3. Encoding for 24-Bit PC-Relative Addressing Mode 
(a) BR, BRD: unconditional branches (standard and delayed) 


oO 


31 25 24 23 


(b) CALL: unconditional subroutine call 


fo} 


31 24 23 


(c) RPTB: repeat block 


oO 


31 25 24 23 
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6.7 Circular Addressing 


Many DSP algorithms, such as convolution and correlation, require a circular 
buffer in memory. In convolution and correlation, the circular buffer acts as a 
sliding window that contains the most recent data to process. As new data is 
brought in, the new data overwrites the oldest data by increasing the pointer 
to the data through the buffer in counter-clockwise fashion. When the pointer 
accesses the end of the buffer, the device sets the pointer to the beginning of 
the buffer. For example, Figure 6—4a shows a circular buffer that holds six values. 
Figure 6-4b shows how this buffer is implemented in the ‘C3x memory space. 
Figure 6-5 shows this buffer after writing three values. Figure 6-6 shows this 
buffer after writing eight values. 


Figure 6-4. Logical and Physical Representation of Circular Buffer 


a) Logical representation b) Physical representation 


Start End Start 


mre End 


Figure 6-5. Logical and Physical Representation of Circular Buffer after Writing Three Values 


a) Logical representation b) Physical representation 


End 
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Figure 6-6. Logical and Physical Representation of Circular Buffer after Writing Eight Values 


a) Logical representation b) Physical representation 


To implement a circular buffer in the ’C3x, the following criteria must be satis- 
fied (more than one circular buffer can be implemented on the ’C3x as long as 
the size of the buffers are identical): 


1 Specify the size of the circular buffer (R) by storing the length of the buffer 
in the block-size register (BK). The size of the buffer must be less than or 
equal to 64K (16 bits). 


(1 Align the start of the buffer to a K-bit boundary (that is, the K LSBs of the 
starting address of the circular buffer must be 0) by satisfying the following 
formula: 


2KSR 
where: 


R= length of circular buffer 
K= number of Os in the LSBs of the circular buffer starting address 


Example 6-23. Examples of Formula 2K > R 


Length of Buffer BK Register Value Starting Address of Buffer 


31 31 XXXXXXXXXXXXXXXXXXXO000V0S 
32 32 XXXXXXXXXXXXXXXXXX000000 
1024 1024 XXXXXXXXXXXXX000000000005 
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In circular addressing, index refers to the K LSBs (from the K-bit boundary criteria) 
of the auxiliary register selected, and step is the quantity being added to or 
subtracted from the auxiliary register. Follow these two rules when you use cir- 
cular addressing: 


J The step used must be less than or equal to the block size. The step size 
is treated as an unsigned integer. If an index register (IR) is used as a step 
increment or decrement, it is also treated as an unsigned integer. 


The first time the circular queue is addressed, the auxiliary register must 
be pointing to an element in the circular queue. 


The algorithm for circular addressing is as follows: 


If 0 <index + step < BK: index = index + step. 
Else if index + step > BK: index = index + step — BK. 
Else if index + step < 0: index = index + step + BK. 


Figure 6-7 shows how the circular buffer is implemented and illustrates the 
relationship of the quantities generated and the elements in the circular buffer. 


Figure 6-7. Circular Buffer Implementation 


Address Data 


31 K K-1 0 Top of circular buffer 


Effective base (EB) PoH.H | 0...o =e Element 0 
MSBs of ARn Element 1 


Auxiliary register (ARn) > Element (K LSBs of ARn) 


31 K K-1 0 Last element 
LSBs BK > Last element + 1 


MSBs of ARn 


Example 6-24 shows circular addressing operation. Assuming that all ARs 
are four bits, let ARO = 0000 and BK = 0110 (block size of 6). Example 6-24 
shows a sequence of modifications and the resulting value of ARO. 
Example 6-24 also shows how the pointer steps through the circular queue 
with a variety of step sizes (both incrementing and decrementing). 
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Example 6-24. Circular Addressing 


*ARO+4(5)% 
*ARO+4(2)% 
*ARO——(3)% 
*ARO+4(6)% 
*ARO——% 
*ARO 


Sth — 
4th, 3rd — 


ist 


; ARO = 


; ARO 
; ARO 
; ARO 
; ARO 


; ARO = 


ofp RH OO 


0 value) 

1st value) 
2nd value) 
3rd value) 
4th value) 
5th value) 


SS a ee 


Data Address 


Element 0 0 
Element 1 1 
Element 2 


Element 3 


Element 5 (last element) 


2 
3 
5 
6 


Last element + 1 


Circular addressing is especially useful for the implementation of FIR filters. 
Figure 6-8 shows one possible data structure for FIR filters. Note that the ini- 
tial value of ARO points to h(N-1), and the initial value of AR1 points to x(0). 
Circular addressing is used in the ’C8x code for the FIR filter shown in 
Example 6-25. 


Figure 6-8. Data Structure for FIR Filters 
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Impulse response 


Input samples 
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Example 6-25. FIR Filter Code Using Circular Addressing 


Impulse Response 


.sect “Impulse Resp” 


.float 1.0 
.float 0.99 
-float 0.95 


-float 0.1 


Input Buffer 


-usect “Input_Buf”,128 


.data 


HADDR .word H 
XADDR .word X 


N 


TOP 


.word 128 
Initialization 

LDP HADDR 

LDI @N, BK 

LDI @HADDR, ARO 

LDI @XADDR,AR1 


LDF IN, R3 
STF R3,*AR1++% 


Filter 


RPTS N-1 


; Load block size. 

; Load pointer to impulse re- 
; sponse. 

; Load pointer to bottom of 
; input sample buffer. 


;Read input sample. 

;Store with other samples, 
jand point to top of buffer. 
; Initialize RO. 

;Initialize R2. 


;Repeat next instruction. 


MPYF3 *ARO++%,*AR1++%,RO 


ADDF3 RO,R2,R2 
ADDF RO,R2 


STF R22, ¥ 
B TOP 


;Multiply and accumulate. 
;Last product accumulated. 


;Save result. 
;Repeat. 
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6.8 Bit-Reversed Addressing 


6-26 


The ’C3x can implement fast Fourier transforms (FFT) with bit-reversed ad- 
dressing. Whenever data in increasing sequence order is transformed by an 
FFT, the resulting data is presented in bit-reversed order. To recover this data 
in the correct order, certain memory locations must be swapped. By using the 
bit-reversed addressing mode, swapping data is unnecessary. The data is ac- 
cessed by the CPU in bit-reversed order rather than sequentially. For correct 
bit-reversed access, the base address of bit-reversed access, the base ad- 
dress must be located on a boundary given by the size of the FFT table. Similar 
to circular addressing, the base address of bit-reversed addressing must fol- 
low this criteria. 


[1 Base address must be aligned to a K-bit boundary (that is, the K LSBs of 
the starting address of the buffer/table must be 0) as follows: 


2KSR 
where: 


R= length of table/buffer 
K= number of Os in the LSBs of the buffer/table starting address 


Lj Size of the buffer/table must be less than or equal to 64K (16 bits) 


The CPU bit-reversed operation can be illustrated by assuming an FFT table 
of size 2". When real and imaginary data are stored in separate arrays, the n 
LSBs of the base address must be 0, and IRO must be equal to 29-1 (half of 
the FFT size). When real and imaginary data are stored in consecutive 
memory locations (Realg, Imaginaryg, Real;, Imaginary;, Realo, Imaginaryo, 
etc.), the n+1 LSBs of the base address must be 0, and IRO must be equal to 
2” (FFT size). 


For CPU bit-reversed addressing, one auxiliary register points to the physical 
location of data. Adding IRO (in bit-reversed addressing) to this auxiliary register 
performs a reverse-carry propagation. IRO is treated as an unsigned integer. 


To illustrate bit-reversed addressing, assume 8-bit auxiliary registers. Let AR2 
contain the value 0110 0000 (96). This is the base address of the data in 
memory assuming a 16-entry table. Let IRO contain the value 0000 1000 (8). 
Example 6-26 shows a sequence of modifications of AR2 and the resulting 
values of AR2. 
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Example 6-26. Bit-Reversed Addressing 


*AR2++(IRO)B; AR2= 0110 0000 (0th value 
*AR2++(IRO)B; AR2= 0110 1000 (1st value 
*AR2++(IRO)B; AR2= 0110 0100 (2nd value 
*AR2++(IRO)B; AR2= 0110 1100 (3rd value 
*AR2++(IRO)B; AR2= 0110 0010 (4th value 
*AR2++(IRO)B; AR2= 0110 1010 (5th value 
*AR2++(IRO)B; AR2= 0110 0110 (6th value 
*AR2 ; AR2= 0110 1110 (7th value 


Table 6-3 shows the relationship of the index steps and the four LSBs of AR2. 
You can find the four LSBs by reversing the bit pattern of the steps. 


Table 6-3. Index Steps and Bit-Reversed Addressing 


Bit-Reversed Bit-Reversed 
Step Bit Pattern Pattern Step 
0 0000 0000 0 
1 0001 1000 8 
2 0010 0100 4 
3 0011 1100 12 
4 0100 0010 2 
5 0101 1010 10 
6 0110 0110 6 
7 0111 1110 14 
8 1000 0001 1 
9 1001 1001 9 
10 1010 0101 5 
11 1011 1101 13 
12 1100 0011 3 
13 1101 1011 11 
14 1110 0111 7 


15 11114 1111 15 
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6.9 Aligning Buffers With the TMS320 Floating-Point DSP Assembly 
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Language Tools 


To align buffers to a K-bit boundary, you can use the .sect or .usect assembly 
directives to define a section in conjunction with the align memory allocation 
parameter of the sections directive of the linker command file. For the FIR filter 
of Example 6-25 with a length of 32, the linker command file is: 


MEMORY 


{ 


RAM origin = Oh, length = 1000h 
SECTIONS 
.text: > RAM 


Impulse Resp ALIGN (64) : 
Input_Buf ALIGN (64) : 


RAM 


> 
> RAM 


System and User Stack Management 


6.10 System and User Stack Management 


The ’C3x provides a dedicated system-stack pointer (SP) for building stacks 
in memory. The auxiliary registers can also be used to build a variety of more 
general linear lists. This section discusses the implementation of the following 
types of linear lists: 


L] Stack 


The stack is a linear list for which all insertions and deletions are made at 
one end of the list. 


LJ Queue 


The queue is a linear list for which all insertions are made at one end of the 
list and all deletions are made at the other end. 


(7 Dequeue 


The dequeue is a double-ended queue linear list for which insertions and 
deletions are made at either end of the list. 


6.10.1 System-Stack Pointer 


The system-stack pointer (SP) is a 32-bit register that contains the address of 
the top of the system stack. The system stack fills from low-memory address 
to high-memory address (see Figure 6-9). The SP always points to the last 
element pushed onto the stack. A push performs a preincrement, and a pop 
performs a postdecrement of the system-stack pointer. 


The program counter is pushed onto the system stack on subroutine calls, 
traps, and interrupts. It is popped from the system stack on returns. The sys- 
tem stack can be pushed and popped using the PUSH, POP, PUSHF, and 
POPF instructions. 


Figure 6-9. System Stack Configuration 


Low memory 


Bottom of stack 


High memory 
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6.10.2 Stacks 


Stacks can be built from low to high memory or high to low memory. Two cases 
for each type of stack are shown. Stacks can be built using the preincrement/ 
decrement and postincrement/decrement modes of modifying the auxiliary 
registers (AR). Stack growth from high-to-low memory can be implemented in 
two ways: 


CASE 1: Stores to memory using *-—ARn to push data onto the stack and 
reads from memory using *~ARn++ to pop data off the stack. 


CASE 2: Stores to memory using *~ARn--—to push data onto the stack and 
reads from memory using * ++ARn to pop data off the stack. 


Figure 6-10 illustrates these two cases. The only difference is that in case 1, 
the AR always points to the top of the stack, and in case 2, the AR always points 
to the next free location on the stack. 


Figure 6-10. Implementations of High-to-Low Memory Stacks 
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Case 1 Case 2 
Low memory Low memory 


ARn 
ARn ~ Top of stack Top of stack 


Bottom of stack 


Bottom of stack 


High memory High memory 


Stack growth from low-to-high memory can be implemented in two ways: 


CASE 3: Stores to memory using *++ARn to push data onto the stack and 
reads from memory using *ARn--—to pop data off the stack. 


CASE 4: Stores to memory using *ARn++ to push data onto the stack and 
reads from memory using *- —ARn to pop data off the stack. 


Figure 6-11 shows these two cases. In case 3, the AR always points to the top 
of the stack. In case 4, the AR always points to the next free location on the stack. 
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Figure 6—11.lmplementations of Low-to-High Memory Stacks 


6.10.3 Queues 


Case 3 Case 4 
Low memory Low memory 


Bottom of stack Bottom of stack 


ARn — Top of stack 


High memory High memory 


A queue is like a FIFO. The implementation of queues is based on the manipu- 
lation of auxiliary registers. Two auxiliary registers are used: one to mark the 
front of the queue from which data is popped (or dequeued) and the other to 
mark the rear of the queue where data is pushed. With proper management 
of the auxiliary registers, the queue can also be circular. (A queue is circular 
when the rear pointer is allowed to point to the beginning of the queue memory 
after it has pointed to the end of the queue memory.) 
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Program Flow Control 


The TMS320C3x provides a complete set of constructs that facilitate software 
and hardware control of the program flow. Software control includes repeats, 
branches, calls, traps, and returns. Hardware control includes reset operation, 
interrupts, and power management. You can select the constructs best suited 
for your particular application. 
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7.1 Repeat Modes 


The repeat modes of the ’C3x can implement zero-overhead looping. For many 
algorithms, most execution time is spent in an inner kernel of code. Using the 
repeat modes allows these time-critical sections of code to be executed in the 
shortest possible time. 


The 'C3x provides two instructions to support zero-overhead looping: 


1 RPTB (repeat a block of code). RPTB repeats execution of a block of code 
a specified number of times. 


_) RPTS (repeat a single instruction). RPTS fetches a single instruction once 
and then repeats its execution a number of times. Since the instruction is 
fetched only once, bus traffic is minimized. 


RPTB and RPTS are 4-cycle instructions. These four cycles of overhead occur 
during the initial execution of the loop. All subsequent executions of the loop 
have no overhead (0 cycle). 


Three registers (RS, RE, and RC) control the updating of the program-counter 
(PC) when it is modified in a repeat mode. Table 7-1 describes these registers. 


Table 7-1. Repeat-Mode Registers 
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Register Function 


RS Repeat start-address register. Holds the address of the first instruction 
of the block of code to be repeated. 


RE Repeat end-address register. Holds the address of the last instruction 
of the block of code to be repeated. RE = RS (see subsection 7.1.2). 


RC Repeat-counter register. Contains 1 less than the number of times 
the block remains to be repeated. For example, to execute a block 
ntimes, load n—1 into RC. 


Correct operation of the repeat modes requires that all of the above registers 
must be initialized correctly. RPTB and RPTS perform this initialization in 
slightly different ways. 


Repeat Modes 


7.1.1. Repeat-Mode Control Bits 
Two bits are important to the operation of RPTB and RPTS: 


L) RM bit. The repeat-mode (RM) flag bit in the status register specifies 
whether the processor is running in the repeat mode. 


m RM =O: Fetches are not made in repeat mode. 
m RM = 1: Fetches are made in repeat mode. 


1 S bit. The S bit is internal to the processor and cannot be programmed, 
but this bit is necessary to fully describe the operation of RPTB and RPTS. 


m If RM=1andS =O, RPTB is executing. Program fetches occur from 
memory. 

m If RM =1 and S = 1, RPTS is executing. After the first fetch from 
memory, program fetches occur from the instruction register. 


7.1.2 Repeat-Mode Operation 


Information in the repeat-mode registers and associated control bits controls 
the modification of the PC during repeat-mode fetches. The repeat modes 
compare the contents of the RE register (repeat-end-address register) with the 
PC after the execution of each instruction. If they match and the repeat counter 
(RC) is nonnegative, the RC is decremented; the PC is loaded with the repeat- 
start-address, and the processing continues. The fetches and appropriate 
status bits are modified as necessary. Note that the RC is never modified 
when the RM flag is 0. 


The repeat counter should be loaded with a value 1 less than the number of 
times to execute the block; for example, an RC value of 4 executes the block 
five times. The detailed algorithm for the update of the PC is shown in 
Example 7-1. 


ho. EE 
Note: Maximum Number of Repeats 


1) The maximum number of repeats occurs when RC = 8000 0000h. This 
results in 8000 0001h repetitions. The minimum number of repeats occurs 
when RC = 0. This results in one repetition. 


2) RE must be greater than or equal to RS (RE = RS). Otherwise, the code 
does not repeat even though the RM bit remains set to 1. 


3) By writing a 0 into the repeat counter or writing 0 into the RM bit of the 


status register, you can stop the repeating of the loop before completion. 
EEE EE EE EEE ___S=______=={_=_a=a=____—=S==—_—_— 
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Example 7-1. Repeat-Mode Control Algorithm 


if first time through 
fetch instruction from memory 
else 
fetch instruction from IR 
RC - 1 74 RC 


if RC < 0 
0 — ST(RM) 
o- Ss 
PC + 1-— PC 
else if S == 0 
fetch instruction from memory 
if PC == RE 
RC - 1 4 RC 
if RC 20 
RS — PC 
else if RC < 0 
0 — ST(RM) 
o- s 


PE ee 2 = PC 


If in repeat mode (RPTB or RPTS) 
If RPTS 

If this is the first fetch 
Fetch instruction from memory 
If not the first fetch 

Fetch instruction from IR 
Decrement RC 

If RC is negative 

Repeat single mode completed 
Turn off repeat-mode bit 
Clear S 

Increment PC 

If RPTB 

Fetch instruction from memory 
If this is the end of the block 
Decrement RC 

If RC is not negative 

Set PC to start of block 

If RC is negative 

Turn off repeat mode bits 
Clear S 

Increment PC 


7.1.3. RPTB Instruction 


The RPTB instruction repeats a block of code a specified number of times. 


The number of times to repeat the block is the RC (repeat count) register value 
plus 1. Because the execution of RPTB does not load the RC, you must load this 
register yourself. The RC register must be loaded before the RPTB instruction is 
executed. A typical setup of the block repeat operation is shown in Example 7-2. 


Example 7-2. RPTB Operation 


LDI 15,RC 
RPTB  ENDLOOP 
STLOOP 
ENDLOOP 


; Load repeat counter with 15 
; Execute the block of code 

; from STLOOP to ENDLOOP 16 

; times 
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All block repeats initiated by RPTB can be interrupted. When RPTB src 
(source) instruction executes, it performs the following sequence: 


1) Load the start address of the block into repeat-start-address register (RS). 
This is the next address following the instruction: 
RS + PC (program-counter) of RPTB + 1 


2) Load the end address of the block into repeat-end-address register (RE). 


[] In PC-relative mode, the end address is the 24-bit src operand plus 
RS: 


RE + src+ PC of RPTB + 1 


(1 In register mode, the end address is the contents of the src register: 


RE + src register 


3) Set the status register to indicate the repeat-mode operation. 
RM+ 1 


4) Indicate repeat-mode operation by clearing the S bit. 
S$+0 


ae | 


Note: 


You can stop the loop from repeating before its completion by writing a 0 to 
the repeat counter or writing a 0 to the RM bit of the status register. 


| | 


7.1.4 RPTS Instruction 


An RPTS src instruction repeats the instruction following the RPTS (src + 1) 
times. Repeats of a single instruction initiated by RPTS are not interruptible, 
because the RPTS fetches the instruction word only once and then keeps it 
in the instruction register for reuse. An interrupt in this situation would cause 
the instruction word to be lost. Refetching the instruction word from the instruction 
register reduces memory accesses and, in effect, acts as a one-word program 
cache. If you need a single instruction that is repeatable and interruptible, you can 
use the RPTB instruction. 


When RPTS src is executed, the following sequence of operations occurs: 


1) PC+1—-RS 

2) PC+1—7RE 

3) 1-— RM status register bit 

4) 1->Sbit 

5) src— RC (repeat count register) 
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The RPTS instruction loads all registers and mode bits necessary for the opera- 
tion of the single-instruction repeat mode. Step 1 loads the start address of the 
block into RS. Step 2 loads the end address into the RE (end address of the 
block). Since this is a repeat of a single instruction, the start address and the end 
address are the same. Step 3 sets the status register to indicate the repeat 
mode of operation. Step 4 indicates that this is the repeat single-instruction 
mode of operation. Step 5 loads src into RC. 


7.1.5 Repeat-Mode Restrictions 


Because the block-repeat modes modify the program counter, no other 
instruction can modify the program counter at the same time. Two rules apply: 


Rule 1: The last instruction in the block (or the only instruction in a block of 
size 1) cannot be a Bcond, BR, DBcond, CALL, CALLcond, TRAP- 
cond, RETIcond, RETScond, IDLE, RPTB, or RPTS. Example 7-3 
shows an incorrectly placed standard branch. 


Rule 2: None of the last four instructions from the bottom of the block (or the 
only instruction in a block of size 1) can be a BconaD, BRD, or 
DBcondD. Example 7-4 shows an incorrectly placed delayed 
branch. 


If either of these rules is violated, the PC is undefined. 


Example 7-3. Incorrectly Placed Standard Branch 
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LDI 15,RC ; Load repeat counter with 15 
RPTB ENDLOOP ; Execute the block of code 
STLOOP ; from STLOOP to ENDLOOP 16 


; times 
ENDLOOP BR OOPS ; This branch violates rule 1 


Repeat Modes 


Example 7-4. Incorrectly Placed Delayed Branch 


LDI 15,RC 
RPTB  ENDLOOP 


Load repeat counter with 15 
Execute block of code 


eT 


STLOOP from STLOOP to ENDLOOP 16 
times 
BRD OOPS ; This branch violates rule 2 
ADDF 
MPYF 


ENDLOOP SUBF 


7.1.6 RC Register Value After Repeat Mode Completes 


For the RPTB instruction, the RC register normally decrements to 0000 0000h 
unless the block size is 1; in which case, it decrements to FFFF FFFFh. However, 
if the RPTB instruction using a block size of 1 has a pipeline conflict in the instruc- 
tion being executed, the RC register decrements to 0000 0000h. Example 7-5 
illustrates a pipeline conflict. See Chapter 8 for pipeline information. 


RPTS normally decrements the RC register to FFFF FFFFh. However, if the 
RPTS has a pipeline conflict on the last cycle, the RC register decrements to 
0000 0000h. 


TT | 


Note: Number of Repetitions 


In any case, the number of repetitions is always RC + 1. 
ee 


Example 7-5. Pipeline Conflict in an RPTB Instruction 


EDC 


ENDLOOP 


.word 40000000h ; The program is located in 4000000Fh 
LDP EDC 

LDI @EDC, ARO 

LDI 15,RC ; Load repeat counter with 15 


RPTB ENDLOOP 


LDI 


; Execute block of code 

*ARO,RO ; The *ARO read conflicts with 
; the instruction fetching 

; Then RC decrements to 0 

; If cache is enabled, RC decrements 


; to FFFF FFFFh 
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7.1.7 Nested Block Repeats 


7-8 


Block repeats (RPTB) can be nested. Since the registers RS, RE, RC, and 
ST control the repeat-mode status, these registers must be saved and restored 
in order to nest block repeats. For example, if you write an interrupt service routine 
that requires the use of RPTB, it is possible that the interrupt associated with the 
routine may occur during another block repeat. The interrupt service routine can 
check the RM bit to determine whether the block repeat mode is active. If this RM 
is set, the interrupt routine must save ST, RS, RE, and RC, in that order. The inter- 
rupt routine can then perform a block repeat. Before returning to the interrupted 
routine, the interrupt routine must restore RC, RE, RS, and ST, in that order. If the 
RM bit is not set, you do not need to save and restore these registers. 


a | 


Note: Saving/Resioring Registers in Correct Order 


The order in which the registers are saved/restored is important to guarantee 
correct operation. The ST register must be restored last, after the RC, RE, 
and RS registers. ST must be restored after restoring RC, because the RM 
bit cannot be set to 1 if the RC register is 0 or —1. For this reason, if you 
execute a POP ST instruction (with ST (RM bit) = 1) while RC = 0, the POP 
instruction recovers all the ST register bits but not the RM bit that stays at 0 
(repeat mode disabled). Also, RS and RE must be correctly set before you 


activate the repeat mode. 
| 


The RPTS instruction can be used in a block repeat loop if the proper registers 
are saved. 


Because the program counter is modified at the end of the loop according to the 
contents of registers RS, RE, and RC, no operation should attempt to modify 
the repeat counter or the program-counter to a different value at the end of the 
loop. It takes four cycles in the pipeline to save and restore these registers. 
Hence, sometimes, it may be more economical to implement a nested loop by 
the more traditional method of using a register as a counter and then using a 
delayed branch or a decrement and branch-delayed instructions, rather than 
using nested repeat blocks. Often implementing the outer loop as a counter and 
the inner loop as RPTB instruction produces the fastest execution. 


Delayed Branches 


7.2 Delayed Branches 


The ’C3x offers three main types of branching: standard, delayed, and condi- 
tional delayed. 


Standard branches empty the pipeline before performing the branch, ensuring 
correct management of the program counter and resulting in a ’C3x branch 
taking four cycles. Included in this class are repeats, calls, returns, and traps. 


Delayed branches on the ’C3x do not empty the pipeline, but rather execute 
the next three instructions before the program counter is modified by the 
branch. This results in a branch that requires only a single cycle, making the 
speed of the delayed branch very close to that of the optimal block repeat 
modes of the ’C3x. However, unlike block-repeat modes, delayed branches 
may be used in situations other than looping. Every delayed branch has a stan- 
dard branch counterpart that is used when a delayed branch cannot be used. 
The delayed branches of the ’'C3x are BcondD, BRD, and DBcondD. 


Conditional delayed branches use the conditions that exist at the end of the 
instruction immediately preceding the delayed branch. They do not depend on 
the instructions following the delayed branch. The condition flags are set by a 
previous instruction only when the destination register is one of the extended- 
precision registers (RO—R7) or when one of the compare instructions (CMPF, 
CMPF83, CMPI, CMPI3, TSTB, or TSTB3) is executed. Delayed branches 
guarantee that the next three instructions will execute, regardless of other 
pipeline conflicts. 


When a delayed branch is fetched, it remains pending until the three subsequent 
instructions are executed. The following instructions cannot be used in the three 
instructions after a delayed branch (see Example 7-6): 


Bcond DBcondD 
BcondD IDLE 
IDLE2 
BR RETIcond 
BRD RETScond 
CALL RPTB 
CALLcond RPTS 
DBcond TRAPcond 


Delayed branches disable interrupts until the completion of the three instruc- 
tions that follow the delayed branch regardless of whether the branch is or is 
not performed. 


SSS Sw SS S420 _[———aaj__a>_——saS_ ——_—_—aU 500 vw eee 


Note: Incorrect Use of Delayed Branches 


If delayed branches are used incorrectly, the PC is undefined. 
SS eeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeEeEeEeEeEeEeEeEeEeEeEeEA~Ey7™RBa»aR»—e 
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Example 7-6. Incorrectly Placed Delayed Branches 


Bil: BD Li 


B2: B L2 ; This branch is incorrectly placed. 


For faster execution, it might still be advantageous to use a delayed branch 
followed by NOP instructions by trading increased program size for faster 
speed. This is shown in Example 7—7 where a NOP takes the place of the third 
unused instruction after the delayed branch. 


Example 7-7. Delayed Branch Execution 


* TITLE DELAYED BRANCH EXECUTION 


LDF* +AR1(5),R2 ; Load contents of memory to R2 


BGED SKIP ; If loaded number >=0, branch 
; (delayed) 
LDFN R2,R1 ; If loaded number <0, load it to R1 
SUBF 3.0,R1 ; Subtract 3 from R1 
NOP ; Dummy operation to complete delayed 
; branch 
MPYF 1.5,R1 ; Continue here if loaded number <0 
SKIP LDF R1,R3 ; Continue here if loaded number >=0 
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7.3 Calls, Traps, and Returns 


Calls and traps provide a means of executing a subroutine or function while 
providing a return to the calling routine. 


The CALL, CALLcond, and TRAP cond instructions store the value of the PC 
on the stack before changing the PC’s contents. The RETScond or RETIcond 
instructions use the value on the stack to return execution from traps and calls. 
CALL is a 4-cycle instruction, while CALLcond and TRAP cond are 5-cycle 
instruction. 


(J The CALL instruction places the next PC value on the stack and places 
the src (Source) operand into the PC. The src is a 24-bit immediate value. 
Figure 7-1 shows CALL response timing. 


(J The CALLcond instruction is similar to the CALL instruction except for two 
differences: 


m It executes only if a specific condition is true (the 20 conditions— 
including unconditional—are listed in Table 13-12 on page 13-30). 


m The srcis either a PC-relative displacement or is in register-addres- 
sing mode. 


The condition flags are set by a previous instruction only when the destination 
register is one of the extended-precision registers (RO-R7) or when one of 
the compare instructions (CMPF, CMPF3, CMPI, CMPI3, TSTB, or TSTB3) 
is executed. 


(J) The TRAPcond instruction also executes only if a specific condition is true 
(same conditions as for the CALLcond instruction). When executing, the 
following actions occur: 


1) Interrupts are disabled with 0 written to bit GIE of the ST. 
2) The next PC value is stored on the stack. 


3) The specified vector is retrieved from the trap-vector table and is loaded 
into the PC. The vector address corresponds to the trap number in the 
instruction. 


Using the RETIcond to return reenables interrupts by setting the bit field of 
the status register. 


(J RETScond returns execution from any of the above three instructions by 
popping the top of the stack to the PC. For RETScond to execute, the 
specified condition must be true. The conditions are the same as for the 
CALLcond instruction. 
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Lj) RETIcond returns from traps or calls like the RETScond, with the addition 
that RETIcond also sets the GIE bit of the status register, which enables 
all interrupts whose enabling bit is set to 1. The conditions for RETIcond 
are the same as for the CALLcond instruction. 


Functionally, calls and traps accomplish the same task — a subfunction is 
called and executed, and control is then returned to the calling function. Traps 
offer two advantages over calls: 


(J Interrupts are automatically disabled when a trap is executed. This allows 
critical code to execute without risk of being interrupted. Traps are generally 
terminated with a RETIcond instruction to reenable interrupts. 


Lj) You can use traps to indirectly call functions. This is particularly benefi- 
cial when a kernel of code contains the basic subfunctions to be used by 
applications. In this case, you can modify the functions in the kernel and 
relocate them without recompiling each application. 


Figure 7-1. CALL Response Timing 


Fetch CALL Decode CALL Read CALL Execute CALL Fetch first 
(store PC subroutine 
| | | on stack) instruction | 


a FF &~F &SF FTF &F & | 
H1 NF STF YF BY BY KS 
First instruction 
A 
Data = i 
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7.4 Interlocked Operations 


One of the most common parallel processing configurations is the sharing of 
global memory by multiple processors. For multiple processors to access this 
global memory and share data in a coherent manner, some sort of arbitration 
or handshaking is necessary. This requirement for arbitration is the purpose 
of the ’C3x interlocked operations. 


The ’C3x provides a flexible means of multiprocessor support with five instruc- 
tions, referred to as interlocked operations. Through the use of external signals, 
these instructions provide powerful synchronization mechanisms. They also 
guarantee the integrity of the communication and result in a high-speed opera- 
tion. The interlocked-operation instruction group is listed in Table 7-2. 


Table 7-2. Interlocked Operations 


Mnemonic Description Operation 
LDFI Load floating-point value into a register, Signal interlocked 
interlocked src — dst 
LDII Load integer into a register, interlocked Signal interlocked 
src — dst 
SIGI Signal, interlocked Signal interlocked 
Clear interlock 
STF Store floating-point value to memory, Clear interlock 
interlocked src — dst 
STII Store integer to memory, interlocked Clear interlock 
src — dst 


The interlocked operations use the two external flag pins, XFO and XF1. XFO 
must be configured as an output pin; XF1 is an input pin. When configured in 
this manner: 


_} XFO signals an interlock operation request 


_j XF1 acts as an acknowledge signal for the requested interlocked operation. 
In this mode, XFO and XF1 are treated as active-low signals. 


The external timing for the interlocked loads and stores is the same as for 
standard loads and stores. The interlocked loads and stores may be extended 
like standard accesses by using the appropriate ready signal (RDYjp;¢ or 
XRDY int). (RDYjn¢ and XRDY;p; are a combination of external ready input and 
software wait states. see Chapter 9, External Memory Interface, for more 


information on ready generation.) 
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The LDFI and LDII instructions perform the following actions: 


1) Simultaneously set XFO to 0 and begin a read cycle. The timing of XFO is 
similar to that of the address bus during a read cycle. 


2) Execute an LDF or LDI instruction and extend the read cycle until XF1 
is set to 0 and a ready (RDYjnt or XRDYjpj) is signaled. The read cycle 
completes one H1/H3 cycle after the XF1 signal is detected. 


3) Leave XFO set to 0 and end the read cycle. 


eS | 


Note: Timing Diagrams for LDFI and LDII 


The timing diagrams for LDFI and LDII shown on the data sheets depict a 0 
wait state read cycle. Since the read cycle is extended for one H1/H3 cycle 
after XF1 signal is detected, the data sheets show the XF1 signal sampled 
one H1/H3 cycle before setting the XFO signal low. 


For the sequence of steps described here, the read cycle finishes one H1/H3 


cycle after the XF1 signal is detected. 
a | 


The read/write operation is identical to any other read/write cycle except for 
the special use of XFO and XF1. The src operand for LDFI and LDII is always 
a direct or indirect memory address. XFO is set to 0 only if the src is located 
off chip; that is, STRB, STRBO, STRB1, MSTRB, or IOSTRB is active, or the 
src is one of the on-chip peripherals. If on-chip memory is accessed, then XFO 
is not asserted, and the operation executes as an LDF or LDI from internal 
memory. 


The STFI and STII instructions perform the following operations: 


1) Simultaneously set XFO to 1 and begin a write cycle. The timing of XFO is 
similar to that of the address bus during a write cycle. 


2) Execute an STF or STI instruction and extend the write cycle until a ready 
(RDYjn¢ Or XRDYjnz) is signaled. 


As in the case for LDFI and LDII, the dst of STFI and STII affects XFO. If dst 
is located off chip (STRB, STRBO, STRB1, MSTRB, or IOSTRB is active) or 
the dst is one of the on-chip peripherals, XFO is set to 1. If on-chip memory is 
accessed, then XFO is not asserted and the operation executes as an STF or 
STI to internal memory. 


The SIGI instruction functions as follows: 


1) Sets XFO to 0 
2) Idles until one H1/H3 cycle after XF1 is set to 0 
3) Sets XFO to 1 and ends the operation 


Interlocked Operations 


ae | 


Note: Timing Diagrams for SIGI 


The timing diagrams for SIGI shown in the data sheets depict a zero wait 
state condition. Since the device idles until one cycle after XF1 is signaled, 
the data sheets show the XF1 signal sampled one H1/H3 cycle before setting 
the XFO signal low. 


For the sequence of steps described here, the device idles past one H1/H3 
cycle after the XF1 signal is detected. 
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7.4.1 Interrupting Interlocked Operations 


While the LDFI, LDII, and SIGI instructions are waiting for XF1 to be set to 0, 
you can interrupt them. LDFI and LDII require a ready signal (RDYjn} or 
XRDY jn) in order to be interrupted. Because interrupts are taken on bus-cycle 
boundaries (see Section 7.6 on page 7-26), an interrupt may be taken any time 
after a valid ready. If the interrupted LDFI or LDII consists of a multicycle load, 
the load stops and an unknown value might be loaded into the register. 


Interrupting an LDFI, LDII, or SIGI instruction allows you to implement 
protection mechanisms against deadlock conditions by interrupting an 
interlocked load that has taken too long. Upon return from the interrupt, the 
next instruction is executed. The STFI and STII instructions are not 
interruptible. Since the STFI and STII instructions complete when ready is 
signaled, the delay until an interrupt can occur is the same as for any other 
instruction. 


7.4.2 Using Interlocked Operations 
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Note: Incorrect Use of Interlock Instructions 


Do not place an STFI or STII back-to-back with an LDFI, LDII, or SIGI instruction 
as follows: 


STFI R1, *AR1L ; 
LDFI *AR1, R2 ; Incorrect use of interlock instructions 


See Section 7.4.3, Pipeline Effects of Interlocked Instructions, on page 7-19. 
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Interlocked operations can be used to implement a busy-waiting loop, to manipu- 
late a multiprocessor counter, to implement a simple semaphore mechanism, or 
to perform synchronization between two ’C3x devices. The following examples 
illustrate the usefulness of the interlocked operations instructions. 
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Example 7-8 shows the implementation of a busy-waiting loop. If location 
LOCK is the interlock for a critical section of code, and a nonzero means the 
lock is busy, the algorithm for a busy-waiting loop can be used as shown. 


Example 7-8. Busy-Waiting Loop 


LDI 1,R0 ; Put 1 into RO 

L1: LDII @LOCK, R1 Interlocked operation begun 
Contents of LOCK —4 R1 

Put RO (= 1) into LOCK, XFO = 1 
Interlocked operation ended 
Keep trying until LOCK = 0 


STII RO, @LOCK 


BNZ Li 


Example 7-9 shows how a location COUNT may contain a count of the number 
of times a particular operation must be performed. This operation may be per- 
formed by any processor in the system. If the count is 0, the processor waits 
until it is nonzero before beginning processing. The example also shows the 
algorithm for modifying COUNT correctly. 


Example 7-9. Multiprocessor Counter Manipulation 


CT's OR 4,I0F 7 AEOCS 2 
Interlocked operation ended 


LDII @COUNT, R1 ; Interlocked operation begun 
; Contents of COUNT > R1 

BZ CT ; If COUNT = 0, keep trying 

SUBI RI ; Decrement R1 (= COUNT) 

STII R1,@COUNT ; Update COUNT, XFO = 1 


Interlocked operation ended 


Figure 7-2 illustrates multiple ’C3x devices sharing global memory and using the 
interlocked instructions as in Example 7-10, Example 7-11, and Example 7-12. 
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Figure 7-2. Multiple TMS320C3xs Sharing Global Memory 


Global memory 


ADDR 
DATA 
CTRL 


Lock, count, or S 


Arbitration logic 


Local Local 
memory memory 


Sometimes it may be necessary for several processors to access some 
shared data or other common resources. The portion of code that must access 
the shared data is called a critical section. 


To ease the programming of critical sections, semaphores may be used. 
Semaphores are variables that can take only nonnegative integer values. Two 
primitive, indivisible operations are defined on semaphores (with S being a 


semaphore): 
v(S) S+178 
P(S): P: if (S == 0), go to P 


else S-175 S85 


Indivisibility of V(S) and P(S) means that when these processes access and 
modify the semaphore S; they are the only processes doing so. 


To enter a critical section, a P operation is performed on a common semaphore, 
say S (Sis initialized to 1). The first processor performing P(S) will be able to enter 
its critical section. All other processors are blocked because S has become 0. 
After leaving its critical section, the processor performs a V(S), thus allowing 
another processor to execute P(S) successfully. 
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The ’C3x code for V(S) is shown in Example 7-10; code for P(S) is shown in 
Example 7-11. Compare the code in Example 7-11 to the code in Example 7-9, 
which does not use semaphores. 


Example 7-10. Implementation of V(S) 


Vi BDIT @S,RO Interlocked read of S begins (XFO = 0) 
Contents of S — RO 
Increment RO (= S) 


Update S, end interlock (XFO = 0) 


ADDI 1,R0 
STII RO,@S 


Example 7-11. Implementation of P(S) 


P: OR 4,IOF ; End interlock (XFO = 1) 

NOP ; Avoid potential pipeline conflicts when 
; executing out of cache, on-chip memory 
; Or zero wait-state memory 

LDII @S,RO ; Interlocked read of S begins 
; Contents of S > RO 
; If S = 0, go to P and try again 
; Decrement RO (= S) 
; Update S, end interlock (XFO = 1) 


BZ P 
SUBI 1,R0 
STII RO,@S 


The SIGI operation can synchronize, at an instruction level, multiple ‘C3xs. 
Consider two processors connected as shown in Figure 7-3. The code for the 
two processors is shown in Example 7-12. 


Figure 7-3. Zero-Logic Interconnect of TMS320C3x Devices 
'C3x #1 'C3x #2 


Processor #1 runs until it executes the SIGI. It then waits until processor #2 
executes a SIGI. At this point, the two processors are synchronized and continue 
execution. 
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Example 7-12. Code to Synchronize Two TMS320C3x Devices at the Software Level 


Time Code for 'C3x #1 Code for ’C3x #2 
O e e 
o @ 
e e 
SIGI e 
@ 
e 
@ 
(WAIT) e 
@ 
 ) 
@ 


@® <——__ Synchronization occurs ————?#> SIGI 


fo, 


7.4.3 Pipeline Effects of Interlocked Instructions 


Before performing an interlocked instruction, the XFO pin must be configured as 
an output pin and the XF1 pin must be configured as an input pin through the 
IOF register (see subsection 3.1.10, //O Flag Register (IOF), on page 3-16). 
After the XFO and XF1 pins are configured, no interlocked instruction can occur 
in the following two instructions. 


Program Flow Control 7-19 


Interlocked Operations 


Example 7-13. Pipeline Delay of XF Pin Configuration 


Pipeline Operation 


PC | Fetch Decode | Read | Execute 
XFO set as an 
” LDI 2h, IOF output pin and 
XF1 set as an 
n+1 NOP LDI 2h, IOF input pin 
n+2 NOP NOP LDI 2h, IOF 
n+3 LDII *AR1, R1 NOP NOP LDI 2h, IOF 
n+4 LDII *AR1, R1 NOP NOP 
XF1 sampled _ 
n+5 LDII *AR1, R1 NOP 
n+6 LDII *AR1, R1 


XFO driven low 
and XF1 sampled 


STFI and STII instructions drive the XFO pin high during its execution phase. 
LDFI, LDII, and SIGI instructions sample the XF1 pin during its decode phase 
while driving the XFO pin low during its read phase. Therefore, do not use an 
LDFI, LDII, or SIGI instruction immediately after an STFI or STII instruction 
(see Example 7-14). 


Example 7-14. Incorrect Use of Interlocked Instructions 


Pipeline Operation 
PC | Fetch Decode | Read | Execute | 


n STFI R1, *AR1 


XFO pin 
n+1 LDFI *AR1, R2 STFI R1, *AR1 driven high 
XF1 pin -— 
sampled | 442 LDFI *AR1, R2 STFI R1, *AR1 


n+3 LDFI *AR1, R2 STFI R1, *AR1 


n+4 XFO pin LDFI *AR1, R2 
driven low 
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7.5 Reset Operation 


The ’C3x supports a nonmaskable external reset signal (RESET), which is 
used to perform system reset. This section discusses the reset operation. 


At start-up, the state of the ‘C3x processor is undefined. You can use the RESET 
signal to place the processor in a known state. This signal must be asserted low 
for ten or more H1 clock cycles to guarantee a system reset. H1 is an output 
clock signal generated by the ’C3x. (Check the datasheet for your device for the 
specific signal descriptions and electrical characteristics.) 


Reset affects the other pins on the device in either a synchronous or asynchro- 
nous manner. The synchronous reset is gated by the ’C3x’s internal clocks. The 
asynchronous reset directly affects the pins and is faster than the synchronous 
reset. Table 7-3 shows the state of the ’C3x’s pins after RESET = 0. Each pin is 
described according to whether the pin is reset synchronously or asynchronously. 


Table 7-3. TMS320C3x Pin Operation at Reset 


Signal 


D31—-D0 
A23—A0 

R/W 

IOSTRB 
STRBO_B3/A_; 
STRBO_B2/A_p 
STRBO_B1 
STRBO_BO 
STRB1_B3/A_1 
STRB1_B2/A_p 
STRB1_B1 
STRB1_BO 
STRB 

RDY 

HOLD 


Device 
‘C31/ 
Operation at Reset ‘C30 VC33 ‘C32 
Primary Bus Interface Signals 
Synchronous reset; placed in high-impedance state in a in 
Synchronous reset; placed in high-impedance state Val a in 
Synchronous reset; deasserted by going to a high level in Va in 
Synchronous reset; deasserted by going to a high level in in 
Synchronous reset; deasserted by going to a high level in 
Synchronous reset; deasserted by going to a high level a 
Synchronous reset; deasserted by going to a high level in 
Synchronous reset; deasserted by going to a high level in 
Synchronous reset; deasserted by going to a high level in 
Synchronous reset; deasserted by going to a high level al 
Synchronous reset; deasserted by going to a high level in 
Synchronous reset; deasserted by going to a high level in 
Synchronous reset; deasserted by going to a high level val Va 
Reset has no effect a in nw 
Reset has no effect a in a 
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Table 7-3. TMS320C3x Pin Operation at Reset (Continued) 


Signal 
HOLDA 
PRGW 


XD31-XDO 
XA12-XA0 
XR/W 
MSTRB 
XRDY 


RESET 

INT3-INTO 

IACK 

MC/MP or MCBL/MP 
SHZ 

XF1-XFO 


CLKXO 
DX0 
FSX0 
CLKRO 
DRO 
FSRO 


CLKX1 
DX1 
FSX1 
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Reset has no effect 

Reset has no effect 

Expansion Bus Interface 

Synchronous reset; placed in high-impedance state 
Synchronous reset; placed in high-impedance state 


Synchronous reset; placed in high-impedance state 


Synchronous reset; deasserted by going to a high level 


Reset has no effect 
Control Signals 
Reset input pin 


Reset has no effect 


Synchronous reset; deasserted by going to a high level 


Reset has no effect 

Reset has no effect 

Asynchronous reset; placed in high-impedance state 
Serial Port 0 Signals 

Asynchronous reset; placed in high-impedance state 
Asynchronous reset; placed in high-impedance state 
Asynchronous reset; placed in high-impedance state 
Asynchronous reset; placed in high-impedance state 
Asynchronous reset; placed in high-impedance state 
Asynchronous reset; placed in high-impedance state 
Serial Port 1 Signals 

Asynchronous reset; placed in high-impedance state 
Asynchronous reset; placed in high-impedance state 


Asynchronous reset; placed in high-impedance state 


‘C30 
lA 


Y ¥ YY XY YX Y ¥ ¥ ¥ ¥ 


Y ¥ ¥ YY YX 


X 


X 


Device 


‘C31/ 
VC33 


JA 


YX X FY YN 


YY ¥ ¥ XY ¥X 


‘C32 
iA 


al 


Y¥ XY ¥ ¥ XY XN 
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Table 7-3. TMS320C3x Pin Operation at Reset (Continued) 


Device 
‘C31/ 
Signal Operation at Reset ‘C30 VC33 ‘C32 
CLKR1 Asynchronous reset; placed in high-impedance state in 
DR1 Asynchronous reset; placed in high-impedance state val 
FSR1 Asynchronous reset; placed in high-impedance state in 
TimerO Signal 
TCLKO Asynchronous reset; placed in high-impedance state al in in 
Timer1 Signal 
TCLK1 Asynchronous reset; placed in high-impedance state in in in 
Supply and Oscillator Signals 
Vpp Reset has no effect a a a 
lIODVpp Reset has no effect a 
ADVpp Reset has no effect a 
PDVpp Reset has no effect al 
DDVpp Reset has no effect a 
MDVpp Reset has no effect a 
Vss Reset has no effect nw vw nw 
DVss Reset has no effect Va nw 
CVss Reset has no effect al al 
ss Reset has no effect Val nw 
VeBP Reset has no effect a nw 
VsuBs Reset has no effect in a nw 
X41 Reset has no effect a in 
X2/CLKIN Reset has no effect Val a a 
H1 Synchronous reset; will go to its initial state when RESET val Va in 
makes a 1 to 0 transition 
H3 Synchronous reset; will go to its initial state when RESET in a in 


makes a 1 to 0 transition 
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Table 7-3. TMS320C3x Pin Operation at Reset (Continued) 


Device 
‘C31/ 
Signal Operation at Reset ‘C30 VC33 ‘C32 
EMUO Undefined al al is 
EMU1 Undefined al al od 
EMU2 Undefined al al ad 
EMU3 Undefined al ad ad 
EMU4 Undefined al 
EMU5 Undefined a 
EMU6 Undefined in 
RSVO Undefined al 
RSV1 Undefined a 
RSV2 Undefined a 
RSV3 Undefined a 
RSV4 Undefined in 
RSV5 Undefined a 
RSV6 Undefined a 
RSV7 Undefined a 
RSV8 Undefined a 
RSV9 Undefined al 
RSV10 Undefined a 


7-24 


Reset Operation 


At system reset, the following additional operations are performed: 


LJ 


= 


Lj 


The peripherals are reset. This is a synchronous operation. Peripheral reset 
is described in Chapter 12, Peripherals. 


The external bus control registers are reset. The reset values of the control 
registers are described in Chapter 9, ‘C30 and ’C31/VC33 External- 
Memory Interface. 


The following CPU registers are loaded with 0: 


m ST (CPU status register), except in the ‘C32, the PRGW status bit field 
is loaded with the status of the PRGW pin 

m JE (CPU/DMA interrupt-enable flags) 

m IF (CPU interrupt flags) 

mg IOF (I/O flags) 


The reset vector is read from memory location Oh. On the ’C32, this is a 
32-bit data read. Once read, this value is loaded into the PC. This vector 
contains the start address of the system reset routine. 


At this point, code location is dictated by the PC. 


Multiple ’C3x devices, driven by the same system clock, may be reset and 
synchronized. When the 1 to 0 transition of RESET occurs, the processor is 
placed on a well-defined internal phase, and all of the ’C3x devices come up 
on the same internal phase and all internal memory locations. 


Unless otherwise specified, all registers are undefined after reset. 
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TMS320C30 and TMS320C31/VC33 Interrupt Vector Table 


The ’C3x supports multiple internal and external interrupts, which can be used for 
a variety of applications. Internal interrupts are generated by the DMA controller, 
timers, and serial ports. Four external maskable interrupt pins include INTO — 
INT3. Interrupts are automatically prioritized allowing interrupts to occur simulta- 
neously and serviced in a predefined order. This section discusses the operation 
of these interrupts. 


Additional information regarding internal interrupts can be found in Sec- 
tion 12.3.7, DMA and Interrupts, on page 12-65; Section 12.1.8, Timer Inter- 
rupts on page 12-13; and Section 12.2.11, Serial-Port Interrupt Sources, on 
page 12-34. 


Table 7-4 and Table 7-5 contain the interrupt vectors. In the microprocessor 
mode of the ‘C30 and the 'C31/VC33 (Table 7-4) and the microcomputer 
mode of the ’C31/VC33 (Table 7-5), the interrupt vectors contain the address- 
es of interrupt service routines that should start executing when an interrupt 
occurs. On the other hand, in the microcomputer/boot-loader mode of the 
°C31/VC33, the interrupt vector contains a branch instruction to the start of the 
interrupt service routine. 


Interrupts 


Table 7-4. Reset, Interrupt, and Trap-Vector Locations for the TMS320C30/ 


TMS320C31/VC33 Microprocessor Mode 


Address 
00h 
Oth 
02h 
03h 
04h 
05h 


06h 


07h 


08h 


09h 
OAh 


3Bh 


3Ch 
3Dh 
3Eh 
3Fh 


Name 
RESET 
INTO 
INT1 
INT2 
INT3 
XINTO 


RINTO 
XINT1t 
RINT1Tt 


TINTO 
TINT1 
DINT 
Reserved 
e 

e 

e 
Reserved 
TRAP 0 


TRAP 27 


TRAP 28 (reserved 
TRAP 29 (reserved 
TRAP 30 ( 

TRAP 31 ( 


reserved 


) 
) 
) 
) 


reserved 


t Reserved on ’C31/VC33 


Function 

External reset signal input 
External interrupt on the INTO pin 
External interrupt on the INTT pin 
External interrupt on the INT2 pin 
External interrupt on the INT3 pin 


Internal interrupt generated when serial port 
0 transmit buffer is empty 


Internal interrupt generated when serial port 
0 transmit buffer is full 


Internal interrupt generated when serial port 
1 transmit buffer is empty 


Internal interrupt generated when serial port 
1 transmit buffer is full 


Internal interrupt generated by timerO 
Internal interrupt generated by timer1 


Internal interrupt generated by DMA controller 


Internal interrupt generated by TRAP 0 
instruction 


Internal interrupt generated by TRAP 27 
instruction 
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Table 7-5. Reset, Interrupt, and Trap-Branch Locations for the TMS320C31/VC33 fost 
Microcomputer Boot Mode 


Address Name Function 

809FC1 INTO External reset signal input 

809FC2 INT4 External interrupt on the INTO pin 

809FC3 INT2 External interrupt on the INT1 pin 

809FC4 INT3 External interrupt on the INT2 pin 

809FC5 XINTO External interrupt on the INT3 pin 

809FC6 RINTO Internal interrupt generated when 
serial port 0 transmit buffer is 
empty 

809FC7 XINT1 (Reserved) 

809FC8 RINT1 (Reserved) 

809FC9 TINTO Internal interrupt generated by 
timerO 

809FCA TINT1 Internal interrupt generated by 
timer 

809FCB DINT Internal interrupt generated by DMA 
controller 


809FCC-809FDF Reserved 


809FE0 TRAPO Internal interrupt generated by 
TRAP 0 instruction 
809FE1 TRAP1 Internal interrupt generated by 
TRAP 1 instruction 
e e 
e e 
e e 
809FFB TRAP27 Internal interrupt generated by 


TRAP 27 instruction 


809FFC-809FFF Reserved 
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7.6.2 TMS320C32 Interrupt Vector Table cowl 


Similarly to the rest of the ’C3x device family, the ‘C32’s reset vector location 
remains at address 0. On the other hand, the interrupt and trap vectors are 
relocatable. This is achieved by a new bit field in the CPU interrupt flag register 
called the interrupt-trap table pointer (ITTP), shown in Figure 3-11 on page 
3-15. The ITTP bit field dictates the starting location (base) of the interrupt- 
trap-vector table. This base address is formed by left-shifting the value of the 
ITTP bit field by eight bits. This shifted value is called the effective base address 
and is referenced as EA[ITTP], as shown in Figure 7-4. Therefore, the location 
of an interrupt or trap vector is given by the addition of the effective base address 
formed by the ITTP bit field (EA[ITTP]) and the offset of the interrupt or trap 
vector in the interrupt-trap-vector table, as shown in Table 7-6. For example, 
if the ITTP contains the value 100h, the serial-port transmit interrupt vector will 
be located at 10005h. Note that the vectors stored in the interrupt-trap-vector 
table are the addresses of the start of the respective interrupt and trap routines. 
Furthermore, the interrupt-trap-vector table must lie on a 256-word boundary, 
since the eight LSBs of the effective base address of the interrupt-trap-vector 
table are 0. 


Figure 7-4. Effective Base Address of the Interrupt-Trap-Vector Table 


23 8 7 0 


EA[ITTP] = Bits 31-16 of the CPU interrupt flag register 00000000 
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Table 7-6. Interrupt and Trap-Vector Locations for the TMS320C32 cow! 
Address Name Function 
EA[ITTP]+00h—s«éReserved 
EA[ITTP] + 01h INTO External interrupt on the INTO pin 
EA[ITTP] + 02h INT1 External interrupt on the INT1 pin 
EA[ITTP] + 03h INT2 External interrupt on the INT2 pin 
EA[ITTP] + 04h INT3 External interrupt on the INT3 pin 
EA[ITTP] + 05h XINTO Internal interrupt generated when serial port 0 transmit buffer 
is empty 
EA[ITTP] + 06h RINTO Internal interrupt generated when serial port 0 transmit buffer 
is full 
EA[ITTP] + 07h Reserved 
EA[ITTP] + 08h Reserved 
EA[ITTP] + 09h TINTO Internal interrupt generated by timerO 
EA[ITTP] + OAh TINT1 Internal interrupt generated by timer1 
EA[ITTP] + OBh DINTO Internal interrupt generated by DMA channel 0 
EA[ITTP] + O0Ch DINT1 Internal interrupt generated by DMA channel 1 
EA[ITTP] + ODh Reserved 
EA[ITTP] + 1Fh Reserved 
EA[ITTP] + 20h TRAP 0 Internal interrupt generated by TRAP 0 instruction 
e 
° 
° 
EA[ITTP] + 3Bh TRAP 27 Internal interrupt generated by TRAP 27 instruction 
EA[ITTP] + 3Ch TRAP 28 (reserved) 
EA[ITTP] + 3Dh TRAP 29 (reserved) 
EA[ITTP] + 3Eh TRAP 30 (reserved) 
EA[ITTP] + 3Fh TRAP 31 (reserved) 
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7.6.3 Interrupt Prioritization 


When two interrupts occur in the same clock cycle or when two previously 
received interrupts are waiting to be serviced, one interrupt is serviced before 
the other. The CPU handles this prioritization by servicing the interrupt with the 
least priority. The priority of interrupts is handled by the CPU according to the 
interrupt vector table. Priority is set according to position in the table—those with 
displacements closest to the base address of the table are higher in priority. 
Table 7-7 shows the priorities assigned to the reset and interrupt vectors. 


Table 7-7. Reset and Interrupt Vector Priorities 


Reset or 
Interrupt 


RESET 
INTO 
INT1 
INT2 
INT3 
XINTO 
RINTO 
XINT1t 
RINT1t 
TINTO 
TINT! 


DINT/ 
DINTO 


DINT14 


Vector 

Location Priority Function 

Oh 0 External reset signal input on the RESET pin 

th | External interrupt on the INTO pin 

2h 2 External interrupt on the INT1 pin 

3h 3 External interrupt on the INT2 pin 

4h 4 External interrupt on the INT3 pin 

5h 5 Internal interrupt generated when serial-port 0 transmit buffer is empty 
6h 6 Internal interrupt generated when serial-port 0 receive buffer is full 

7h 7 Internal interrupt generated when serial-port 1 transmit buffer is empty 
8h 8 Internal interrupt generated when serial-port 1 receive buffer is full 

9h 9 Internal interrupt generated by timerO 

OAh 10 Internal interrupt generated by timer1 

OBh 11 Internal interrupt generated by DMA channel 0 

OCh 12 Internal interrupt generated by DMA channel 1 


t Reserved on ’C31/VC33 and ’C32 
+ Present on ‘C32 only 
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7.6.4 CPU Interrupt Control Bits 


Three CPU registers contain bits that control interrupt operation: 


Lj] Status (ST) register 


The CPU global interrupt-enable bit (GIE) located in the CPU status register 
(ST) controls all maskable CPU interrupts. When this bit is set to 1, the CPU 
responds to an enabled interrupt. When this bit is cleared to 0, all CPU inter- 
rupts are disabled. see Section 3.1.7 on page 3-5 for more information. 


Lj CPU/DMA interrupt-enable (IE) register 


This register individually enables/disables CPU, DMA external, serial port, 
and timer interrupts. See Section 3.1.8 on page 3-9 for more information. 


Lj) CPU interrupt flag (IF) register 


This register contains interrupt flag bits that indicate the corresponding 
interrupt is set. See Section 3.1.9 on page 3-11 for more information. 


7.6.5 Interrupt Flag Register Behavior 
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When an external interrupt occurs, the corresponding bit of the IF register is set 
to 1. When the CPU or DMA controller processes this interrupt, the corresponding 
interrupt flag bit is cleared by the internal interrupt acknowledge signal. However, 
for level-triggered interrupts, if INTn is still low when the interrupt acknowledge 
signal occurs, the interrupt flag bit is cleared for only one cycle and then set again, 
because INTn is still low. Depending on when the IF register is read, it is also 
possible that this bit may be 0 even though INTn is 0. When the ’C3x is reset, 
0 is written to the interrupt flag register, clearing all pending interrupts. 


The interrupt flag register bits can be read from and written to under software 
control. Writing a 1 to an IF register bit sets the associated interrupt flag to 1. 
Similarly, writing a 0 resets the corresponding interrupt flag to 0. In this way, all 
interrupts may be triggered and/or cleared through software. Since the interrupt 
flags may be read, the interrupt pins may be polled in software when an inter- 
rupt-driven interface is not required. 


Internal interrupts operate in a similar manner. In the IF register, the bit corre- 
sponding to an internal interrupt can be read from and written to through software. 
Writing a 1 sets the interrupt latch; writing a 0 clears it. All internal interrupts are 
one H1/H3 cycle in length. If any previous bit value of the IF register needs to be 
preserved, a modification to IF register should be performed with logic operations 
(AND, OR, etc.) directly to IF. 
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Figure 7-5. IF Register Modification 


Correct Incorrect 

LDI @MASK, RO LDI IF, R1 

AND RO, IF AND @MASK, R1 
LDI R1, IF 


Note: IF Register Load Priority 


If a load of the IF register occurs simultaneously with a set or reset of a flag 
by an interrupt pulse, the loading of the flag has higher priority and overwrites 
the IF register value. 


eee ss) 


7.6.6 Interrupt Processing 


The ’C8x allows the CPU and DMA coprocessor to respond to and process 
interrupts in parallel. Figure 7-6 on page 7-34 shows interrupt processing 
flow; for the exact sequence, see Table 7-8 on page 7-36. 


For a CPU interrupt to occur, at least two conditions must be met: 


.) Allinterrupts must be enabled globally by setting the GIE bit to 0 in the status 
register. 


_) The interrupt must be enabled by setting the corresponding bit in the 
IF register. 


In the CPU interrupt processing cycle (left side of Figure 7-6), the corresponding 
interrupt flag in the IF register is cleared, and interrupts are globally disabled 
(GIE =0). The CPU completes all fetched instructions. The current PC is pushed 
to the top of the stack. The interrupt vector is then fetched and loaded into the 
PC, and the CPU starts executing the first instruction in the interrupt service 
routine (ISR). 
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Figure 7-6. CPU Interrupt Processing 


7-34 


Is an enabled 
interrupt set 


If enabled, If enabled, 
interrupt is interrupt is 
a CPU interrupt a DMA interrupt 


Clear interrupt flag 
DMA proceeds according 
to SYNC bits 
DMA continues 


Disable interrupts 


Clear interrupt flag 


| 


PC — *(++SP) 


Complete all fetched instructions 


PC < interrupt vector 


CPU starts executing ISR routine 


Wo ae 


Note: CPU and DMA Inierrupts 


CPU interrupts are acknowledged (responded to by the CPU) on instruction 
fetch boundaries only. If instruction fetches are halted because of pipeline 
conflicts or execution of RPTS loops, CPU interrupts are not acknowledged 


until the next instruction fetch. 
| aa es || 
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If you wish to make the interrupt service routine interruptible, you can set the 
GIE bit to 1 after entering the ISR. 


The interrupt acknowledge (IACK) instruction can be used to signal externally that 
an interrupt has been serviced. If external memory is specified in the operand, 
IACK drives the IACK pin and performs a dummy read. The read is performed 
from the address specified by the IACK instruction operand. IACK is typically 
placed in the early portion of an ISR. However, depending on your application, 
it may be better to place it at the end of the ISR or not at all. 


Note the following: 


[J Interrupts are disabled during an RPTS and during a delayed branch (until 
the three instructions following a delayed branch are completed). Interrupts 
are held until after the branch. 


(J When an interrupt occurs, instructions currently in the decode and read 
phases continue regular execution, unlike an instruction in the fetch 
phase: 


If the interrupt occurs in the first cycle of the fetch of an instruction, the 
fetched instruction is discarded (not executed), and the address of 
that instruction is pushed to the top of the system stack. 


If the interrupt occurs after first cycle of the fetch (in the case of a multi- 
cycle fetch due to wait states), that instruction is executed, and the 
address of the next instruction to be fetched is pushed to the top of 
the system stack. 


7.6.7 CPU Interrupt Latency 


CPU interrupt latency, defined as the time from the acknowledgement of the 
interrupt to the execution of the first ISR instruction, is at least eight cycles. This 
is explained in Table 7-8 on page 7-36, where the interrupt is treated as an 
instruction. It assumed that all of the instructions are single-cycle instructions. 
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Table 7-8. Interrupt Latency 


Cycle 
1 


Description 


Fetch 


Decode 


Read 


Execute 


Recognize interrupt in single-cycle fetched 


oOo Nn Oo oa FF WwW ND 


(prog a + 1) instruction 

Clear GIE bit. Clear interrupt flag 

Read the interrupt vector table 

Store return address to stack 

Pipeline begins to fill with ISR instruction 
Pipeline continues to fill with ISR instruction 


Pipeline continues to fill with ISR instruction 


proga+1 


prog a 


interrupt 


prog a-1 


prog a 
interrupt 


prog a—2 


prog a-1 
prog a 
interrupt 


isr1 


Execute first instruction of interrupt service routine 


7.6.8 External Interrupts 
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The four external maskable interrupt pins INTO-INT3 are enabled at the IF reg- 
ister (Section 3.1.9, CPU Interrupt Flag (IF) Register, on page 3-11) and are syn- 
chronized internally. They are sampled on the falling edge of H1 and passed 
through a series of H1/H3 latches internally. These latches require the interrupt 
signal to be held low for at least one H1/H3 clock cycle to be recognized by the 
‘C3x. Once synchronized, the interrupt input sets the corresponding interrupt 


flag register (IF) bit if the interrupt is active. 


Figure 7-7 shows a functional diagram of the logic used to implement external 


interrupt inputs. 
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Figure 7-7. Interrupt Logic Functional Diagram 


Internal interrupt 
set signal EINTn(CPU) 


Interrupt  GIE(CPU) 
flag (n) 


INTn = ) > Internal To 
DQ DQ DQ interrupt control 
rocessor section 
| CLK | CLK | CLK ° 


Internal interrupt 
clear/acknowledge 
signal 


H1 H3 H1 
EINTn(DMA) 


These interrupts are prioritized by the selection of one over the other if both 
come on the same clock cycle (INTO the highest, INT1 next, etc.). When an 
interrupt is taken, the status register ST(GIE) bit is reset to 0, disabling any other 
incoming interrupt. This prevents any other interrupt (INTO — INT) from assuming 
program control until the ST(GIE) bit is set back to 1. On a return from an interrupt 
routine, the RET| and RETIcond instructions set the ST(GIE) bit to 1. 


On the ‘C30 and ‘C31/VC33, external interrupts are level triggered. On the 
‘C32, external interrupts are edge or level triggered, depending on the INT 
config bit field of the status register. 


For an edge-triggered interrupt to be detected by the ’C32 the external pin 
must transition from 1 to 0. And then, it needs to be held low for at least one 
H1/H3 cycle (but it could be held low longer). 


For a level-triggered interrupt to be detected by the ’C3x, the external pin must 
be held low for between one and two cycles (1 < low-pulse width < 2). If the 
interrupt is held low for more than two cycles, more than one interrupt might 
be recognized. There is no need to provide an edge in this case. 
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7.7 DMA Interrupts 


Interrupts can also trigger DMA read and write operations. This is called 
DMA synchronization. The DMA interrupt processing cycle is similar to that of 
the CPU. After the pertinent interrupt flag is cleared, the DMA coprocessor 
proceeds according to the status of the SYNC bits in the DMA coprocessor 
global-control register. 


If the interrupt in the CPU/DMA interrupt-enable (IE) register is enabled, the 
interrupt controller automatically latches the interrupt and saves it for future 
DMA use. The interrupt controller latches the interrupt, clears the flag in the IF 
register, and informs the data that an interrupt has occurred. The DMA then pro- 
ceeds with the transfer according to the previously configured CPU/DMA prior- 
ity. Even if the DMA has not been started, the interrupt latch occurs, and the flag 
is cleared, except when the start bits in the DMA control register have the reset 
value 009 in START bits. DMA reset clears the interrupt internal latch. 


7.7.1. DMA Interrupt Control Bits 
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Two registers contain bits used to control DMA interrupt operation: 


Lj CPU/DMA interrupt-enable register (IE). All DMA interrupts are controlled 
by the most significant 16 bits in the IE register and by the SYNC bits of 
the DMA channel control registers (see Section 12.3.3, DMA Registers, 
on page 12-51). The DMA interrupts are not dependent upon ST(GIE) and 
are local to the DMA. 


(1 The DMA channel control register. Each DMA coprocessor channel uses 
a channel control register to determine its mode of operation. This register 
is shown in Section 12.3.3. 


The IE is broken into several subfields that determine which interrupts can be 
used to control the synchronization for each DMA channel. For example, the bits 
in each of these fields allow you to select whether a DMA channel is synchro- 
nized to a port, a timer, or an external interrupt pin. Note that the ‘C32 has two 
DMA channels while the ‘C30 and ’C31/VC33 have a single DMA channel. 


See Section 3.1.8, CPU/DMA Interrupt-Enable Register (IE), on page 3-9, for 
a description of the IE. 
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7.7.2 DMA Interrupt Processing 


Figure 7-8 shows the general flow of interrupt processing by the DMA 
coprocessor. 


Figure 7-8. DMA Interrupt Processing 


Is an enabled 
interrupt set 


If enabled in the IE 
register, the interrupt Is 
a DMA interrupt 


Clear interrupt flag 


DMA proceeds according 
to DMA control register 
SYNC bits 


DMA continues 


For more information about DMA interrupts, see Section 12.3.7, DMA Interrupts 
on page 12-65. 
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7.7.3  CPU/DMA Interaction 


If the DMA is not using interrupts for synchronization of transfers, it is not 
affected by the processing of the CPU interrupts. Detected interrupts are 
responded to by the CPU and DMA on instruction fetch boundaries only. 
Since instruction fetches are halted due to pipeline conflicts or when executing 
instructions in an RPTS loop, interrupts are not responded to until instruction 
fetching continues. It is therefore possible to interrupt the CPU and DMA simul- 
taneously with the same or different interrupts and, in effect, synchronize their 
activities. For example, it may be necessary to cause a high-priority DMA 
transfer that avoids bus conflicts with the CPU (that is, a transfer that makes 
the DMA higher priority than the CPU). This may be accomplished by using 
an interrupt that causes the CPU to trap to an interrupt routine that contains 
an IDLE instruction. Then, if the same interrupt is used to synchronize DMA 
transfers, the DMA transfer counter can be used to generate an interrupt and 
thus return control to the CPU following the DMA transfer. 


Since the DMA and CPU share the same set of interrupt flags, the DMA may 
clear an interrupt flag before the CPU can respond to it. For example, if the 
CPU interrupts are disabled, the DMA can respond to interrupts and thus clear 
the associated interrupt flags. Figure 7-9 shows the sequence of events in the 
interrupt processing for both the CPU and DMA controllers. 


Figure 7-9. Parallel CPU and DMA Interrupt Processing 
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Interrupt 


CPU | DMA coprocessor 


Does 


No GIE=1 and Is the 
is the interrupt interrupt enabled 
enabled in the in the IE 
IE register? register? 


Process the CPU interrupt Process the DMA interrupt 
as shown in Figure 7-6. as shown in Figure 7-8. 
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7.7.4 TMS320C3x Interrupt Considerations 


Give careful consideration to ’C3x interrupts, especially if you make modifications 
to the status register when the global interrupt-enable (GIE) bit is set. This can 
result in the GIE bit being erroneously set or reset as described in the following 
paragraphs. 


The GIE bit field of the status register is set to 0 (reset) by an interrupt. If a load 
of the status register occurs simultaneously with a CPU interrupt pulse trying 
to reset GIE, GIE will be reset. 


Also, resetting GIE by an interrupt or TRAP instruction can cause a processing 
error if any code, following within two cycles of the interrupt recognition, attempts 
to read or modify the status register. For example, if the status register is being 
pushed onto the stack, it will be stored incorrectly if an interrupt was acknowledged 
two cycles before the store instruction. 


When an interrupt signal is recognized, the ‘C3x continues executing the 
instructions already in the read and decode phases in the pipeline. However, 
because the interrupt is acknowledged, the GIE bit is reset to 0, and the store 
instruction already in the pipeline will store the wrong status register value. 


For example, if the program is like this: 
NOP 
interrupt recognized -->LDI @V_ADDR,AR1 
MPYI *AR1, RO 
PUSH ST 
POP ST 
the PUSH ST instruction will save the ST contents in memory, which includes 


GIE = 0. Since the device is expected to have GIE = 1, the POP ST instruction 
will put the wrong status register value into the ST (see Table 7-9). 
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Table 7-9. Pipeline Operation with PUSH ST 


Cycle 


1 


oN OO FF WO PY 


Description 


Read location V_-ADDR 

Load AR1; recognize interrupt 

Clear GIE bit; clear interrupt flag; read SP 
Read interrupt vector table; save ST in stack 


Store return address on stack 


Fetch Decode 

NOP 

LDI NOP 

MPYI LDI 

PUSH MPYI 

- PUSH 
Interrupt 


Read 


NOP 
LDI 
MPYI 
PUSH 


Interrupt 


Execute 


NOP 
LDI 
MPYI 
PUSH 


Interrupt 


The following example shows setting the GIE bit by a load instruction that is 
immediately followed by an interrupt: 


Pe >; GIE = 1 

LDI 02000h, ST; GIE = 0 
interrupt recognized -->MPYI *AR1, RO; 

ADD *ARO, R1 


In this example, the load of the status register or interrupt-flag register overwrites 
the reset of the GIE bit by the interrupt (see Table 7-10). 


Table 7-10. Pipeline Operation with Load Followed by Interrupt 


Cycle Description Fetch Decode Read Execute 
1 LDI 
2 Interrupt recognized - LDI 
3 Interrupt resets GIE bit, clears interrupt flag, interrupt LDI 
reads SP 
4 GIE set by load instruction; interrupt vector table interrupt LDI 
read and ST saved on stack 
5 Store return address on stack interrupt 
6 Fetch first instruction of ISR with GIE = 1 ISR 


A similar situation may occur if the GIE bit = 1 and an instruction executes that 
is intended to modify the other status bits and leave the GIE bit set. In the above 
example, this erroneous setting would occur if the interrupt were recognized two 
cycles before the POP ST instruction. In that case, the interrupt would clear the 
GIE bit, but the execution of the POP instruction would set the GIE bit. Since the 
interrupt has been recognized, the interrupt service routine will be entered with 
interrupts enabled, rather than disabled as expected. 
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One solution is to use an instruction that is uninterruptible such as RPTS as 
follows to set the GIE: 


RPTS 0 
AND 2000h, ST ; Set GIE=1 


Use the following to reset the GIE: 


RPTS 0 
AND ODFFFh, ST ; Set GIE=0 


Another alternative incorporates the following code fragment, which protects 
against modifying or saving the status register by disabling interrupts through 
the interrupt-enable register: 


PUSH IE ; Save IE register  e Added instructions to 
LDI 0, IE Clear IE register avoid pipeline problems 


NOP e 2 NOPs or useful instructions 
NOP 
AND ODFFFh, ST Set GIE = 0 e Instruction that reads or 


POP IE ‘ writes to ST register. 

i Added instruction 

: to avoid pipeline 

z problems. 
In summary, the next three instructions immediately following an instruction 
that clears the GIE bit might be interrupted. Also, the next three instructions 
immediately following an instruction that sets the GIE bit might not be interrupted 
even if there is a pending interrupt (see Example 7-15). Similarly, the next three 
instructions immediately following an instruction that clears an interrupt-enable 
mask might be interrupted. Furthermore, the next three instructions immediately 
following an instruction that sets an interrupt flag might be executed before the 
interrupt occurs. 


Example 7-15. Pending Interrupt 


LDI Oh, ST ; set GIE = 0 

LDI lh, R1 

LDI 2h, R2 

MPYI *AR1, RO ; interrupts still enabled 
ADDI *AR1,RI1 ; interrupts disabled here 
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7.7.5 TMS320C30 Interrupt Considerations 
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The ’C30 silicon revisions earlier than 4.0 have two unique exceptions to the 
interrupt operation. This does not apply to ’C30 silicon revision 4.0 or greater, 
any 'C31/VC33 silicon, or any ’C32 silicon. 


On ’C30 silicon revisions earlier than 4.0: 


1 The status register global interrupt-enable (GIE) bit may be erroneously 
reset to 0 (disabled setting) if all of the following conditions are true: 


H@ Aconditional trap instruction (TRAPcona) has been fetched 
Mm The condition for the trap is false. 


m A pipeline conflict has occurred, resulting in a delay in the decode or 
read phase of the instruction. 


During the decode phase of a conditional trap, interrupts are temporarily 
disabled to ensure that the trap executes before a subsequent interrupt. Ifa 
pipeline conflict occurs and causes a delay in execution of the conditional 
trap, the interrupt disabled condition may become the last known condition of 
the GIE bit. If the trap condition is false, interrupts are permanently disabled 
until the GIE bit is intentionally set. The condition is not present when the trap 
condition is true, because normal operation of the instruction causes the GIE 
to be reset, and standard coding practice sets the GIE to 1 before the trap 
routine is exited. Several instruction sequences that cause pipeline conflicts 
have been found: 


HM LI mem, SP 
TRAPcond n 

HM .LpDI mem, SP 
NOP 
TRAPcond n 

HM sTI SP,mem 
TRAPcond n 

M sTI Rx, *ARy 
LDI *ARx, RY 
| |LDI *ARz, Rw 
TRAPcond n 


Other similar conditions may also cause a delay in the execution. The 
following solution is recommended to avoid or rectify the problem: 
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Insert two NOP instructions immediately before the TRAP cond instruction. 
One NOP is insufficient in some cases, as illustrated in the second bulleted 
item, above. This eliminates the opportunity for any pipeline conflicts in the 
immediately preceding instructions and enables the conditional trap 
instruction to execute without delays. 


Asynchronous accesses to the interrupt flag register (IF) can cause the 
C30 silicon revision prior to 4.0 to fail to recognize and service an inter- 
rupt. This may occur when an interrupt is generated and is ready to be 
latched into the IF register on the same cycle that the IF is being written 
to by the CPU. Note that logic operations (AND, OR, XOR) may write to 
the IF register. 


The logic of ’C30 silicon revision earlier than 4.0 currently gives the CPU 
write priority; consequently, the asserted interrupt might be lost. This is 
true if the asserted interrupt was generated internally (for example, a direct 
memory access (DMA) interrupt). This situation arises as a result of a deci- 
sion to poll certain interrupts or a desire to clear pending interrupts due to a 
long pulse width. In the case of a long pulse width, the interrupt may be 
generated after the CPU responds to the interrupt and attempts to auto- 
matically clear it by the interrupt vector process. 


The recommended solution is to avoid using the interrupt polling technique, 
and to design the external interrupt inputs to have pulse widths between 
1 and 2 instruction cycles. The alternative to strict polling is to periodically 
enable and disable the interrupts that would be polled, allowing the normal 
interrupt vectoring to take place; this automatically clears the interrupt flag 
without affecting other interrupts. If you must clear a pending interrupt, you 
should use a memory location to indicate that the interrupt is invalid. The 
interrupt service routine can read that location, clear it (if the pending inter- 
rupt is invalid), and return immediately. The following code fragments 
show how to handle a dummy interrupt due to a long interrupt pulse: 
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ISR_n: 


ISR_n START: 


ISR_n_END: 
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PUSH 
PUSH 
PUSH 
LDI 
LDI 
BNN 
STI 
POP 
POP 
POP 
RETI 


LDI 
AND 
BNZ 
LDI 
LDI 
STI 


POP 
POP 
POP 
RETI 


ST 
DP 

RO 

0, DP 

@DUMMY_ INT, RO 
ISR_n_START 
DP, @DUMMY_INT 
RO 

DP 

ST 


INT Fn, RO 


IF, RO 
ISR_n_END 
0, DP 
OFFFFh, RO 


RO, @DUMMY INT 


RO 
DP 
ST 


Save registers 


Clear Data-page Pointer 

If DUMMY_INT is O or positive, 
go to ISR_n_ START 

Set DUMMY INT = 0 


Housekeeping, return from interrupt 


Normal interrupt service routine 
Code goes here 


If ones in IF reg match 
INT Fn, exit ISR 
Otherwise clear 

DP and set 

DUMMY _INT negative & exit 


Exit ISR 


Traps 


7.8 Traps 


A trap is the equivalent of a software-triggered interrupt. In the ’C3x, traps and 
interrupts are treated identically, except in the way in which they are triggered. 


7.8.1 Initialization of Traps and Interrupts 
Traps and interrupts are triggered differently in the ’C3x: 


_j Traps are always triggered by a software mechanism, by the TRAPcond 
(conditional trap) instructions. 


_j Interrupts are always triggered by hardware events (for example, by exter- 
nal interrupts, DMA interrupts, or serial-port interrupts). 


The GIE bit in the ST register and the mask bits in the IE do not apply to traps. 
7.8.2 Operation of Traps 

Figure 7-10 shows the general flow of traps which is similar to interrupts. 
Figure 7-10. Flow of Traps 


Trap executed 
(TRAP cond) 


Trap or interrupt service routine 


Return executed 
(RETIcona) 


° otic 
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The RETIcond instruction manipulates the status flags as shown in block (3) 
in Figure 7-10. RETlcond provides a return from a trap or interrupt. 


The ’C3x supports 32 different traps. When a TRAPcond n instruction is 
executed, the ’C3x jumps to the address stored in the memory location pointed 
to by the corresponding trap-vector table pointer. The location of the trap-vector 
table is shown in Table 7-4 on page 7-27 (’C30/’C31/VC33 microprocessor 
mode), Table 7-5 ('C31/VC33 microcomputer boot mode) on page 7-28, and 
Table 7-6 on page 7-30 for the ’C32. 
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7.9 Power Management Modes 


The following ’C3x devices have been enhanced by the addition of two power- 
down modes: IDLE2 and LOPOWER: 


OHOUCUU 


C30 silicon version 7.0 or greater 
*LC31 

C31 silicon revision 5.0 or greater 
C32 

"VC33 


7.9.1 IDLE2 Power-Down Mode 


The H1 instruction clock is held high until one of the four external interrupts is 
asserted. In IDLE2 mode, the ‘C3x devices supporting these modes behave as 
follows: 


_] 
L) 


No instructions are executed. 
The CPU, peripherals, and internal memory retain their previous states. 


The external bus output pins are idle: 

Mm The address lines remain in their previous states. 
mM The data lines are in the high-impedance state. 

@ The output control signals are in their inactive state. 
Oo 


If a multicycle read or write does not preceed the IDLE2 opcode, that 
access will be forzen onto the bus until IDLE2 is exited. This can be 
advantageous for low power applications since the bus is frozen in an 
active state. That is, the device pins are not floating, and therefore do 
not require pullup or pulldowns. 


When the device is in the functional (nonemulation) mode, the clocks stop 
with H1 high and H3 low (see Figure 7-11). 


The devices remain in IDLE2 until one of the four external interrupts 
(INT3-INTO) is asserted for at least one H1 cycle. When one of the four 
interrupts is asserted, the clocks start after a delay of one H1 cycle. When 
the clocks restart, they may be in the opposite phase (that is, H1 may be 
high if H3 was high before the clocks were stopped; H3 may be low if H1 
was previously low). The H1 and H3 clocks remain 180 degrees out of 
phase with each other (see Figure 7-12). 


During IDLE2 operations, the CPU recognizes one of the four external 
interrupts if it is asserted for more than one H1 cycle. To avoid generating 
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multiple false interrupts in level-triggered mode, the interrupt must be 
asserted for fewer than three H1 cycles. 


(1 The interrupt service routine (ISR) must have been set up before placing 
the device in IDLE2 mode, because the instruction following the IDLE2 
instruction is not executed until the RETI (return from interrupt) instruction 
is executed. 


(J When the device is in emulation mode, the H1 and H3 clocks continue to 
run normally and the CPU operates as if an IDLE instruction was 
executed. The clocks continue to run for correct operation of the emulator. 


Delayed Branch 


For correct device operation, the three instructions following a 


delayed branch should not include either IDLE or IDLE2 instructions. 


Figure 7-11.IDLE2 Timing 


Idle 2 execution 


wf NF NN 
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Figure 7-12. Interrupt Response Timing After IDLE2 Operation 


Clocks driven | Interrupt vector Fetch first 
read instruction of 
service routing 


oN NA NSNSKSINS NSAI RIANA VLA 
A) EN ) NG), 
fo “HY Nf Vf NZJ 
INT3 to Tf 
INTO 
INT3 to rr ry a rn | ae 
INTO Flag 
ADDR Vector address ist address 


7.9.2 LOPOWER 


In the LOPOWER (low-power) mode, the CPU continues to execute instructions, 
and the DMA can continue to perform transfers, but at a reduced clock rate of 
CLKIN frequency divided by 16. 


A ’C31/VC33 with a CLKIN frequency of 32 MHz performs identically to a 
2 MHz ’C31/VC33 with an instruction cycle time of 1,000 ns. 


During the read phase of the... The ’C31/VC33 and ’C32... 
LOPOWER instruction (Figure 7-13) Slow to 1/16 of full-speed operation. 


MAXSPEED instruction (Figure 7-14) Resume full-speed operation. 
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Figure 7-13. LOPOWER Timing 
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LOPOWER read 


le 32 CLKIN & 


Figure 7-14. MAXSPEED Timing 
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MAXSPEED read 
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| 
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Pipeline Operation 


Two characteristics of the’C3x that contribute to its high performance are: 


_j Pipelining 
(4 Concurrent I/O and CPU operation 


The following four functional units control ’C3x operation: 


_j Fetch 
_} Decode 
Lj Read 
_j} Execute 


Pipelining is the overlapping or parallel operations of the fetch, decode, read, 
and execute levels of a basic instruction. 


The DMA controller decreases pipeline interference and enhances the CPU’s 
computational throughput by performing input/output operations. 
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8.1 Pipeline Structure 


Figure 8-1. TMS320C3x Pipeline Structure 
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The following list describes the four major units of the ‘C3x pipeline structure and 


their functions: 


Fetch unit (F) 


Decode unit (D) 


Read unit (R) 


Execute unit (E) 


Fetches the instruction words from memory 
and updates the program counter (PC). 


Decodes ithe instruction word and performs 
address generation. Also, the decode unit controls 
modification of the ARn registers in the indirect 
addressing mode and of the stack pointer when 
PUSH to/POP from the stack occurs. 


If required, reads the operands from memory. 


If required, reads the operands from the register 
file, performs the necessary operation, and writes 
results to the register file. If required, results of 
previous operations are written to memory. 


All instruction executions perform these four basic functions: fetch, decode, read, 
and execute. Figure 8-1 illustrates these four levels of the pipeline structure. The 
levels are indexed according to instruction and execution cycle. In the figure, per- 
fect overlap in the pipeline, where all four units operate in parallel, occurs at cycle 
(m). Levels about to be executed are at m +1, and those just previously executed 
are at m—1. The ‘C3x pipeline controller supports a high-speed processing rate 
of one execution per cycle. It also manages pipeline conflicts so that they are 
transparent to you. You do not need to take any special precautions to ensure 
correct operation. 


CYCLE 


m-3 


m-2 


m+3 


Fetch 


N K M & 


| Decode Read Execute 
WwW — — 
x W = 
Perfect 
Y x W overlap 
zZ Y x 
= Z Y 
= = Z 


Note: W, X, Y, Z = Instruction representations 
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For ‘C30 and ‘C31/VC33, priorities from highest to lowest have been assigned 
to each of the functional units of the pipeline and to the DMA controller as fol- 
lows: 


Execute (highest) 
Read 

Decode 

Fetch 

DMA (lowest) 


OOUOUUU 


Despite the DMA controller’s low priority, you can minimize or even eliminate 
conflicts with the CPU through suitable data structuring because the DMA con- 
troller has its own data and address buses. 


In the ‘C32, the DMA has configurable priorities. Therefore, priorities from 
highest to lowest have been assigned to each of the functioned units of the 
pipeline and to the DMA controller as follows: 


DMA (if configured with highest priority) 
Execute 

Read 

Decode 

Fetch 

DMA (if configured with lowest priority) 


OHOUUUOUU 


A pipeline conflict occurs when an instruction is being processed, and is ready 
to pass to the next higher pipeline level while that level is not ready to accept 
a new input. In this case, the lower priority unit waits until the higher priority unit 
completes executing the current function. 
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8.2 Pipeline Conflicts 


Pipeline conflicts in the ’C3x can be grouped into the following categories: 


Branch conflicts Branch conflicts involve most of those instructions or 
operations that read and/or modify the PC. 


Register conflicts Register conflicts involve delays that can occur when 
reading from, or writing to, registers that are used for 
address generation. 


Memory conflicts Memory conflicts occur when the internal units of the 
’C3x compete for memory resources. 


Each of these three types, including examples, is discussed in the following 
subsections. In these examples, when data is refetched or an operation is 
repeated, the symbol representing the stage of the pipeline is appended with 
a number. For example, if a fetch is performed again, the instruction mnemonic 
is repeated. When an access is detained for multiple cycles because the unit 
is not ready, the symbol RDY indicates that a unit is not ready and RDY indi- 
cates that a unit is ready. If the particular unit does not perform a function, the 
nop label is placed in that stage of the pipeline. 


8.2.1. Branch Conflicts 
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The first class of pipeline conflicts occurs with standard (nondelayed) 
branches, that is, BR, Bcond, DBcond, CALL, IDLE, RPTB, RPTS, RETIcond, 
RETScond, interrupts, and reset. Conflicts arise with these instructions and 
operations because, during their execution, the pipeline is used only for the 
completion of the operation; other information fetched into the pipeline is dis- 
carded or refetched, or the pipeline is inactive. This is referred to as flushing 
the pipeline. Flushing the pipeline is necessary in these cases to ensure that 
portions of succeeding instructions do not inadvertently get partially executed. 
TRAP cond and CALLcond are classified differently from the other types of 
branches and are considered later. 


Example 8—1 shows the code and pipeline operation for a standard branch. 


——[——_ SV 0 n — ama —_—u a a a ——a 
Note: Dummy Fetch 


In this example, one dummy fetch (an MPYF instruction) is performed before 

the branch is decoded. After the branch address is available, a new fetch (an 

OR instruction) is performed. 
a) 
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Example 8-1. Standard Branch 


BR THREE ; Unconditional branch 
MPYF ; Not executed 
ADD ; Not executed 
SUBF ; Not executed 
AND ; Not executed 
THREE OR ; Fetched after BR is taken 
STI 


Pipeline Operation 


PC Fetch | Decode Read | Execute | 
n BR - — = ___|Fetch held for 
new PC value 
n+1 MPYF BR _ 
n+1 (nop) (nop) BR — 
n+1 (nop) (nop) (nop) BR 
3 OR (nop) (nop) (nop) 
STI OR (nop) (nop) 
Note: 


Both RPTS and RPTB flush the pipeline, allowing the RS, RE, and RC registers 
to be loaded at the proper time. If these registers are loaded without the use of 
RPTS or RPTB, no flushing of the pipeline occurs. Thus, RS, RE, and RC can 
be used as general-purpose 32-bit registers without pipeline conflicts. When 
RPTB is nested because of nested interrupts, it may be necessary to load and 
store these registers directly while using the repeat modes. Since up to four 
instructions can be fetched before entering the repeat mode, you should follow 
loads by a branch to flush the pipeline. If the RC is changing when an instruc- 
tion is loading it, the direct load takes priority over the modification made by 


the repeat mode logic. 
ee 


Delayed branches are implemented to ensure the fetching of the next three 
instructions. The delayed branches include BRD, BconaD, and DBcondD. 
Example 8-2 shows the code and pipeline operation for a delayed branch. 
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Example 8-2. Delayed Branch 


BRD THREE ; Unconditional delayed branch 
MPYF ; Executed 

ADD ; Executed 

SUBF ; Executed 

AND ; Not executed 


THREE MPYF ; Fetched after SUBF is fetched 


Pipeline Operation 


PC | Fetch | Decode | Read | Execute 
n BRD = 7 = 
No 
n+1 MPYF BRD _ a execute 
delay 
n+2 ADDF MPYF BRD = 
n+3 SUBF ADDF MPYF BRD 
3 MPYF SUBF ADDF MPYF 


8.2.2 Register Conflicts 
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Register conflicts involve reading or writing registers used for addressing. 
These conflicts occur when the pertinent register is not ready to be used. Some 
conditions under which you can avoid register conflicts are discussed in Sec- 
tion 8.3 on page 8-19. 


The registers comprise the following three functional groups: 


Group 1 This group includes auxiliary registers (ARO-AR7), index 
registers (IRO, IR1), and block-size register (BK). 

Group 2 This group includes the data-page pointer (DP). 

Group 3 This group includes the system-stack pointer (SP). 


If an instruction writes to one of these three groups, the decode unit cannot use 
any register within that particular group until the write is complete, that is, until 
the instruction execution is completed. In Example 8-3, an auxiliary register 
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is loaded, and a different auxiliary register is used on the next instruction. Since 
the decode stage needs the result of the write to the auxiliary register, the 
decode of this second instruction is delayed two cycles. Every time the decode 
is delayed, a refetch of the program word is performed; the ADDF is fetched 
three times. Since these are actual refetches, they can cause not only conflicts 
with the DMA controller but also cache hits and misses. 


A post-/preincrement/decrement of an AR register in an instruction is not 


considered a write to a register. A write is in the form of an LDF, LDI, LDII, or 
DB instruction. 


Example 8-3. Write to an AR Followed by an AR for Address Generation 


LDI 7,AR2 ; 7 7 AR2 

NEXT MPYF *AR2,RO0 ; Decode delayed 2 cycles 
ADDF 
FLOAT 


Pipeline Operation 


PC | Fetch | Decode | Read | Execute | 

n LDI Decode/address 
generation held 
until AR write is 

nel nid completed 

n+2 ADDF 

n+2 ADDF 

n+2 ADDF PYF (nop) (nop) 

n+3 FLOAT ADDF MPYF (nop) 


The case for reads of these groups is similar to the cases for writes. If an 
instruction must read a member of one of these groups, the use of that particular 
group by the decode for the following instruction is delayed until the read is 
complete. The registers are read at the start of the execute cycle and require only 
a one-cycle delay of the following decode. For four registers (IRO, IR1, BK, or DP), 
there is no delay. For all other registers, including the SP, the delay occurs. 


Note that an address generation through the use of an AR register (“ARn, 
*++ARn, *-ARn, etc.) in an instruction is not considered a read. 
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In Example 8-4, two auxiliary registers are added together, with the result going 
to an extended-precision register. The next instruction uses a different auxiliary 
register as an address register. 


Example 8-4. A Read of ARs Followed by ARs for Address Generation 


ADDI ARO,AR1,R1 ; ARO+AR1—>RI1 

NEXT MPYF *++AR2,RO ; Decode delayed one cycle 
ADDF 
FLOAT 


Pipeline Operation 


PC | Fetch | Decode | Read | Execute | 

n ADDI _ = = Decode/address 
generation held 
until AR is read 

n+1 MPYF ADDI - — 

n+2 ADDF MPYF ADDI ~+ - 

n+2 ADDF MYPF (nop) ADDI ARO,AR1,R0 

n+3 FLOAT ADDF MPYF (nop) 

Note: 


Loop counter auxiliary registers for the decrement and branch (DBR) instruc- 
tions are regarded in the same way as they are for addressing. The operation 


shown in Example 8-3 and Example 8-4 also can occur for this instruction. 
a) 


8.2.3. Memory Conflicts 


Memory conflicts can occur when the memory bandwidth of a physical memory 
space is exceeded. For example, RAM blocks 0 and 1 and the ROM block can 
support only two accesses per cycle. The external interface can support only 
one access per cycle. Section 8.4, Memory Access for Maximum Performance, 
on page 8-22 contains some conditions under which you can avoid memory 
conflicts. 
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Memory pipeline conflicts consist of the following four types: 


Program wait A program fetch is prevented from beginning. 

Program fetch Incomplete A program fetch has begun but is not yet 
complete. 

Execute only An instruction sequence requires three CPU 


data accesses in a single cycle. 


Hold everything A primary or expansion bus operation must 
complete before another one can proceed. 


These four types of memory conflicts are illustrated in examples and discussed 
in the paragraphs that follow. 


8.2.3.1 Program Wait 
Two conditions can prevent the program fetch from beginning: 


L) The start of a CPU data access when: 


m Two CPU data accesses are made to an internal RAM or ROM block, 
and a program fetch from the same block is necessary. 


Hm One of the external ports is starting a CPU data access, and a program 
fetch from the same port is necessary. 


1 Amulticycle CPU data access or DMA data access over the external bus 
is needed. 


Example 8-5 illustrates a program wait until a CPU data access completes. 
In this case, “ARO and *AR1 are both pointing to data in RAM block 0, and the 
MPYF instruction will be fetched from RAM block 0. This results in the conflict 
shown in Example 8-5. Because more than two accesses can be made to 
RAM block 0 in a single cycle, the program fetch cannot begin and must wait 
until the CPU data accesses are complete. 
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Example 8-5. Program Wait Until CPU Data Access Completes 
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ADDF3 *ARO,*AR1,RO 
FIX 
MPYF 
ADDF3 
NEGB 


Pipeline Operation 


PC | Fetch | Decode | Read | Execute 

n ADDF3 — — - 

n+1 FIX ADDF3 - = Fetch held 
until data 

n+2 (wait) FIX ADDF3 = access 

pais completes 
n+2 MPYF (nop) FIX ADDF3 
n+ ADDF3 MPYF no FI 
3 (nop) Data 

accessed 

n+4 NEGB ADDF3 MPYF (nop) 


Example 8-6 shows a program wait due to a multicycle data-data access or 
a multicycle DMA access. The ADDF, MPYF, and SUBF are fetched from some 
portion in memory other than the external port the DMA requires. The DMA 
begins a multicycle access. The program fetch corresponding to the CALL is 
made to the same external port that the DMA is using. 


Either of two cases may produce this situation: 


[1 One of the following two memory boundaries is crossed: 


H From internal memory to external memory 
m From one external port to another 


Lj) Code that has been cached is executed, and the instruction prior to the 
ADDF is one of the following (conditional or unconditional): 


m@ A delayed branch instruction 
m A delayed decrement and branch instruction 


Even though the DMA has the lowest priority on ‘C30 and ’C31/VC33 or when con- 
figured as such in the ‘C32, multicycle access cannot be aborted. The program 
fetch must wait until the DMA access completes. 


Example 8-6. Program Wait Due to Multicycle Access 


Pipeline Conflicts 


ADDF ; code 
MPY ; code 
SUBF ; code 
CALL ; code 


in 
in 
in 
in 


internal 
internal 
internal 
external 


memory 
memory 
memory 
memory 


Pipeline Operation 


PC Fetch | Decode | Read | Execute 
n ADDF - = _ 
n+1 MPYF ADDF = _ 
n+2 SUBF MPYF ADDF a 
2-cycle DMA 
n+3 (wait) SUBF MPYF ADDF v access 
n+3 CALL (nop) SUBF MPYF 
n+4 _ CALL (nop) SUBF 


8.2.3.2 Program Fetch Incomplete 


A program fetch incomplete occurs when an instruction fetch takes more than 
one cycle to complete because of wait states. In Example 8-7, the MPYF and 
ADDF are fetched from memory that supports single-cycle accesses. The 
SUBF is fetched from memory requiring one wait state. One example that 
demonstrates this conflict is a fetch across a bank boundary on the primary 
port. See Section 9.5 on page 9-12. 
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Example 8-7. Multicycle Program Memory Fetches 


8.2.3.3 Execute Only 


8-12 


Pipeline Operation 


PC Fetch | Decode | Read | Execute 
n MPYF _ - = 
n+1 ADDF MPYF - = 
n+2 RDY SUBF ADDF MPYF = - 

1 wait state 
n+2 RDY SUBF (nop) ADDF MPYF required 
n+3 ADDI SUBF (nop) ADDF 


Note: PC = program counter 


The execute-only type of memory pipeline conflict occurs when performing an 
interlocked load or when a sequence of instructions requires three CPU data 
accesses in a single cycle. There are two cases in which this occurs: 


1 An instruction performs a store and is followed by an instruction that 
performs two memory reads. 


_j Ahn instruction performs two stores and is followed by an instruction that 
performs at least one memory read. 


() An interlocked load (LDII or LDF1) instruction is performed, and XF1 = 1. 


The first case is shown in Example 8-8. Since this sequence requires three 
data memory accesses and only two are available, only the execute phase of 
the pipeline is allowed to proceed. The dual reads required by the LDF || LDF 
are delayed one cycle. In this case, a refetch of the next instruction can occur. 
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Example 8-8. Single Store Followed by Two Reads 


STFR 0, *AR1 ; RO > *ARI 
LDF *AR2,R1 ; *AR2 —R1 in parallel with 
|| LDF *AR3,R2 ; *AR3 >R2 


Pipeline Operation 


PC | Fetch | Decode | Read | Execute 
n STF = = 
Write must 
n+1 LDF ||LDF STF = complete 
before the 
n+2 Ww LDF | |LDF STF two reads can 
complete 
n+3 x W LDF | |LDF STP [5 pads 
performed 
n+4 x W LDF | |LDF (nop) 
n+4 Y x W LDF ||LDF 


Note: W, X, Y = Instruction representations 
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Example 8-9 shows a parallel store followed by a single load or read. Since 
two parallel stores are required, the next CPU data-memory read must wait 
one cycle before beginning. One program-memory refetch can occur. 


Example 8-9. Parallel Store Followed by Single Read 


8-14 


STF RO, *ARO ; RO — *ARO in parallel with 
ll STF R2,*AR1 ; R2 — *ARL 

ADDF @SUM,R1 ; R1 + @SUM >R1 

IACK 

ASH 


Pipeline Operation 


PC | Fetch | Decode | Read | Execute | 
n stF|lstTF = = = 
Read must wait 
n+1 ADDF stF||stF = = until the writes are 
completed 
n+2 IACK ADDF  STF||STF = 
Writes 
n+3 ASH IACK ADDF / stF||sTF performed 
n+4 ASH IACK ADDF (nop) 
n+4 - ASH IACK ADDF 


The final case involves an interlocked load (LDII or LDFl) instruction and XF1 = 1. 
Since the interlocked loads use the XF1 pin as an acknowledge that the read 
can complete, the loads might need to extend the read cycle, as shown in 
Example 8-10. A program refetch can occur. 
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Example 8-10. Interlocked Load 


NOT R1,RO 
LDII 300h,AR 


ADDI *AR2,R2 
CMPI RO,R2 


Pipeline Operation 


PC | XF1 | Fetch | Decode | Read | Execute 


n 1 NOT — - — 
n+1 1 LDITI NOT - — 
n+2 1 ADDI LDII NOT — 
n+3 1 CMPI ADDI LDII NOT XF1i = 1, 
er read must wait 
n+3 1 - CMPI ADDI LDII 
XF1 = 0, 
n+4 0 - CMPI ADDI LDII read operation 


is complete 


8.2.3.4 Hold Everything 
Three situations result in hold-everything memory pipeline conflicts: 


_) A CPU data load or store cannot be performed because an external port 
is busy. 


_) An external load takes more than one cycle. 


(J Conditional calls and traps, which take one more cycle than conditional 
branches, are processed. 


The first type of hold-everything conflict occurs when one of the external ports 
is busy because an access has started, but is not complete. In Example 8-11, 
the first store is a 2-cycle store. The CPU writes the data to an external port. 
The port control then takes two cycles to complete the data-data write. The 
LDF is a read over the same external port. Since the store is not complete, the 
CPU continues to attempt LDF until the port is available. 
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Example 8-11. Busy External Port 


STF RO, @DMA1 
LDF @DMA2 , RO 


Pipeline Operation 


PC | Fetch | Decode | Read Execute 
n STF _ = = 
n+1 LDF STF = = 
n+2 W LDF STF _ 
n+2 WwW LDF (nop) STF * 
2-cycle external bus 
n+2 WwW LDF (nop) (nop) ¥ write access 
n+3 xX W LDF (nop) 
n+4 Y x W LDF 


Note: W, X, Y = Instruction representations 
The second type of hold-everything conflict involves multicycle data reads. 


The read has begun and continues until completed. In Example 8-12, the LDF 
is performed from an external memory that requires several cycles to access. 
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Pipeline Conflicts 


Example 8-12. Multicycle Data Reads 


LDF @DMA, RO 


Pipeline Operation 


PC | Fetch | Decode | Read Execute 
n LDF = = — 
n+1 I LDF =_ = 
n+2 J i. LDF = 
2-cycle external bus 
n+3 K (dummy) I LDF - z. read access 
n+3 Ky J I LDF 
Note: _ I, J, K = Instruction representations 


The final type of hold-everything conflict deals with conditional calls (CALLcona) 
and traps (TRAPcond), which are different from other branch instructions. 
Whereas other branch instructions are conditional loads, the conditional calls 
and traps are conditional stores, which take one more cycle to complete than 
conditional branches (see Example 8-13). The added cycle pushes the return 
address after the call condition is evaluated. 
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Example 8-13. Conditional Calls and Traps 


8-18 


Pipeline Operation 


PC Fetch | Decode | Read | Execute 

n CALLcond = = a 

n+1 I CALLcond - — 

n+1 (nop) (nop) CALLcond — 

n+1 (nop) (nop) (nop) CALLcond PC store 
“7 cycle 

n+1 (nop) (nop) (nop) CALLcond 

spiel ade I (nop) (nop) (nop) 

Note: | = Instruction representation 
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Resolving Register Conflicts 


If the auxiliary registers (AR7—ARO), the index registers (IR1—IRO), data-page 
pointer (DP), or stack pointer (SP) are accessed for any reason other than 
address generation, pipeline conflicts associated with the next memory access 
can occur. The pipeline conflicts and delays are presented in Section 8.2 on 
page 8-4. 


Example 8-14, Example 8-15, and Example 8-16 demonstrate some common 
uses of these registers that do not produce a conflict or ways that you can avoid 
the conflict. 


Example 8-14. Address Generation Update of an AR Followed by an AR for Address 


Generation 


LDF 7.0,RO0 7; 7.0 — RO 
MPYF *++ARO(IR1),RO 

ADDF *AR2,RO0 

FIX 
MPYF 
ADDF 


n+5 


Note: 


| Fetch 
LDF 
MYPF 
ADDF 
FIX 
MPYF 


ADDF 


Pipeline Operation 


| Decode | Read | Execute 
LDF - - 
MYPF LDF _ 
ADDF MvPF | LDF 
FIX ADDF | MYPF 
MYPF FIX ADDF 


W, X, Y, Z = Instruction representations 
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Example 8-15. Write to an AR Followed by an AR for Address Generation Without a 
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Pipeline Conflict 

LDI @TABLE , AR2 

MPYF @VALUE,R1 

ADDF R2,R1 

MPYF *AR2++,R1 

SUBF 

STF 

Pipeline Operation 

PC | Fetch | Decode | Read | Execute 
n LDI = = = 
n+1 MYPF LDI = = 
n+2 ADDF MYPF LDI = 
n+3 MYPF ADDF MYPF pra AR? written] 
n+4 SUBF MYPF ADDF MYPF 
n+5 STF SUBF MYPF ADDF 


AR2 read 


Resolving Register Conflicts 


Example 8-16. Write to DP Followed by a Direct Memory Read Without a Pipeline Conflict 


LDP TABLE ADDR 

POP RO 

LDF *-AR3(2),R1 

LDI @TABLE ADDR, ARO 
PUSHF R6 

PUSH R4 


Pipeline Operation 


PC Fetch | Decode | Read | Execute 

n LDP - = = 

n+1 POP LDP - _ 

n+2 LDF POP LDP — 
n+3 LDI LDF POP LDP 

n+4 PUSHF LDI LDF POP 

n+5 PUSH PUSHF LDI LDF 
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8.4 Memory Access for Maximum Performance 


If program fetches and data accesses are performed so that the resources 
being used cannot provide the necessary bandwidth, the pipeline is stalled 
until the data accesses are complete. Certain configurations of program fetch 
and data accesses yield conditions under which the ’C3x can achieve 
maximum throughput. 


Table 8-1 shows how many accesses can be performed from the different 
memory spaces when it is necessary to do a program fetch and a single data 
access and still achieve maximum performance (one cycle). Four cases 
achieve 1-cycle maximization. 


Table 8-1. One Program Fetch and One Data Access for Maximum Performance 


Primary Expansion Bust 
Bus Accesses From Dual or 
Case No. Accesses_ Access Internal Memory Peripheral Accesses 


1 1 1 = 
2 1 = 1 


2 from any combination of 
internal memory 


4 — 1 1 

t The expansion bus is available only on the ‘C30. 

Table 8-2 shows how many accesses can be performed from the different 
memory spaces when it is necessary to do a program fetch and two data 


accesses and still achieve maximum performance (one cycle). Six conditions 
achieve this maximization. 
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Table 8-2. One Program Fetch and Two Data Accesses for Maximum Performance 


Case No. 
1 


ol 


oOo jo N OD 


Primary Bus 
Accesses 


1 
1 program 
1 data 


1 data 


1 program 


1 DMA 


Accesses From Dual-Access 
Internal Memory 


2 from any combination of internal memory 
1 data 

1 data 

1 program, 1 data 


2 from same internal memory block and 1 from 
a different internal memory block 


3 from different internal memory blocks 
2 from any combination of internal memory 
2 data 


2 data 


t The expansion bus is available only on the ’C30. 


Expansiont Or 
Peripheral Bus 
Accesses 


1 data 
1 program 


1 DMA 


{ 
1 DMA 


1 program 
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8.5 Clocking Memory Accesses 


This section discusses the role of internal clock phases (H1 and H3) and how 
the ’C3x handles multiple-memory accesses. The previous section discusses 
the interaction between sequences of instructions; this section discusses the 
flow of data on an individual instruction basis. 


Each major clock period of 33.3 ns is composed of two minor clock periods of 
16.67 ns, labeled H3 and H1. The active clock period for H3 and H1 is the time 
when that signal is high. See Figure 8-2. 


Figure 8-2. Minor Clock Periods 


‘¢—_—_ Major clock period —> 


H1 
minor 


H1 clock period 
H3 
H3 minor 
clock period 


The precise operation of memory reads and writes can be defined according 
to these minor clock periods. The types of memory operations that can occur 
are program fetches, data loads and stores, and DMA accesses. 


8.5.1 Program Fetches 


Internal program fetches are always performed during H3 unless a single data 
store must occur at the same time due to another instruction in the pipeline. In that 
case, the program fetch occurs during H1 and the data store occurs during HS. 


External program fetches always start at the beginning of H3 with the address 
being presented on the external bus. At the end of H1, the fetches are completed 
with the latching of the instruction word. 


8.5.2 Data Loads and Stores 


Four types of instructions perform loads, memory reads, and stores: 


2-operand instructions 

3-operand instructions 

Multiplier/ALU operation with store instructions 
Parallel multiply and add instructions 


Lt Lo 
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See Chapter 6, Addressing Modes, for more information. 


As discussed in Chapter 7, the number of bus cycles for external memory 
accesses differs in some cases from the number of CPU execution cycles. For 
external reads, the number of bus cycles and CPU execution cycles is identical. 
For external writes, there are always at least two bus cycles, but unless there 
is aport-access conflict, there is only one CPU execution cycle. In the following 
examples, any difference in the number of bus cycles and CPU cycles is noted. 


8.5.2.1 2-Operand Instruction Memory Accesses 


All instructions whose bits 31-29 are 000 or 010 (see Figure 8-3) are 2-operand 
instructions. In the case of a data read, bits 15-0 represent the src operand. 
Internal data reads are always performed during H1. External data reads always 
start at the beginning of H3 with the address presented on the external bus; they 
complete with the latching of the data word at the end of H1. 


In the case of a data store, bits 15-0 represent the dst operand. Internal data 
stores are performed during H3. External data stores always start at the 
beginning of H3 with the address and data being presented on the external bus. 


Figure 8-3. 2-Operand Instruction Word 
3 24 23 16 15 8 7 0 


| 
wc 


8.5.2.2 3-Operand Instruction Memory Reads 


All instructions whose bits 31-29 are 001 (see Figure 8-4) are 3-operand 
instructions. The source operands, src? and src2, come from either registers 
or memory. When one or more of the source operands are from memory, these 
instructions are always memory reads. 


Figure 8-4. 3-Operand Instruction Word 
31 24 23 16 15 


Of 0 


If only one of the source operands is from memory (either src? or src2) and is 
located in internal memory, the data is read during H1. If the single memory 
source operand is in external memory, the read starts at the beginning of H3, 
with the address presented on the external bus, and completes with the latching 
of the data word at the end of H1. 
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If both source operands are to be fetched from memory, then memory reads 
can occur in several ways: 


(1 If both operands are located in internal memory, the src7 read is performed 
during H3 and the src2 read during H1, completing two memory reads in 
a single cycle. 


1 If src7is in internal memory and src2is in external memory, the src2 access 
begins at the start of H3 and latches at the end of H1. At the same time, the 
src? access to internal memory is performed during H3. Again, two memory 
reads are completed in a single cycle. 


(1 If src7 is in external memory and src2 is in internal memory, two cycles are 
necessary to complete the two reads. In the first cycle, both operands are 
addressed. Since src7 takes an entire cycle to be read and latched from 
external memory, the internal operation on src2 cannot be completed until 
the second cycle. Ordering the operands so that src7 is located internally 
is necessary to achieve single-cycle execution. 


(1 ‘If src7 and src2 are both from external memory, two cycles are required to 
complete the two reads. In the first cycle, the src1 access is performed and 
loaded on the next H3; in the second cycle, the src2 access is performed 
and loaded on that cycle’s H1. 


If src2 is in external memory and src7 is in on-chip or external memory and is 
immediately preceded by a single store instruction to external memory, a 
dummy src2 read can occur between the execution of the store instruction and 
the src2 read, regardless of which memory space is accessed (STRB, MSTRB, 
or IOSTRB). The dummy read can cause an externally interfaced FIFO address 
pointer to be incremented prematurely, thereby causing the loss of FIFO data. 
Example 8-17 illustrates how the dummy read can occur. Example 8-18 offers 
an alternative code segment that suppresses the dummy read. In the alternative 
code segment, the dummy read is eliminated by swapping the order of the 
source operands. 
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Example 8-17. Dummy sr2 Read 


STI RO, *AR6 ; AR6 points to MSTRB space 
ADDI3 *AR1,*AR3,RO ; AR3 points to on-chip RAM (src1l) 
; AR1 points to MSTRB space (src2) 
H1 | Lf LJ L__J | 
H3 ~~ | [| TCO Td | 
Pipeline Operation 
PC Fetch | Decode | Read | Execute 
n STI 
n+1 ADDI3 STI 
n+2 ADDI3 STI —_ 

RO, *AR6 until the 
nes - oe store is complete 
n+4 - -— — 
n+5 ADDI3 = 2-cycle dummy 

load of src2 
n+6 - | - 
n+7 Te x = actual read of 
n+8 BDDI3 srce2 and src1 


Two cycles are required for the MSTRB store. Two additional cycles are required 
for the dummy MSTRB read of *AR3 (because a read follows a write). One cycle 
is required for an actual MSTRB read of *AR3. 
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Example 8-18. Operand Swapping Alternative 


n+7 


n+8 
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Switch the operands of the 3-operand instruction so that the internal read is 
performed first. 


Sri RO, *AR6 ; AR6 points to MSTRB space 
ADDI3 *AR3,*AR1,RO ; AR3 points to on-chip RAM (src2) 
; AR1 points to MSTRB space (src1) 
H1 __ id L | Lf LJ | 
Pipeline Operation 
Fetch | Decode | Read | Execute 
STI 
ADDI3 STI 
ADDI3 STI 


— sti * 
2-cycle store 


= -¥ The read of src2 cannot start 
until the store is complete 
2-cycle read of src1 and src2 
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8.5.2.3. Operations with Parallel Stores 


The next class of instructions includes every instruction that has a store in parallel 
with another instruction. Bits 31 and 30 for these instructions are equal to 1 1. 


The instruction word format for operations that perform a multiply or ALU opera- 
tion in parallel with a store is shown in Figure 8-5. If the store operation to dst2 
is external or internal, it is performed during H3. Two bus cycles are required for 
external stores, but only one CPU cycle is necessary to complete the write. 


If the memory read operation is external, it starts at the beginning of H3 and 
latches at the end of H1. If the memory read operation is internal, it is 
performed during H1. Note that memory reads are performed by the CPU 
during the read (R) phase of the pipeline, and stores are performed during the 
execute (E) phase. 


Figure 8-5. Multiply or CPU Operation With a Parallel Store 
3 24 23 16 15 8 7 0 


| 


The instruction word format for instructions that have parallel stores to memory 
is shown in Figure 8-6. If both destination operands, dst? and dst2, are located 
in internal memory, dst7 is stored during H3 and d'st2 during H1, thus completing 
two memory stores in a single cycle. 


Figure 8-6. Two Parallel Stores 
31 24 23 16 15 


8 7 0 
ST||ST src2 000} srci dst 


If dst7 is in external memory and dsi2 is in internal memory, the dst7 store 
begins at the start of H3. The dst2 store to internal memory is performed 
during H1. Two bus cycles are required for the external store, but only one 
CPU cycle is necessary to complete the write. Again, two memory stores 
are completed in a single cycle. 


Oo 


If dst7 is in internal memory and dsi2 is in external memory, an additional bus 
cycle is necessary to complete the dsi2 store. Only one CPU cycle is neces- 
sary to complete the write, but the port access requires three bus cycles. In 
the first cycle, the internal dst7 store is performed during H3, and d'si2 is writ- 
ten to the port during H1. During the next cycle, the dsi2 store is performed 
on the external bus, beginning in H3, and executes as normal through the 
following cycle. 
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If dst? and dst2 are both written to external memory, a single CPU cycle 
is still all that is necessary to complete the stores. In this case, four bus 
cycles are required. 


1) Inthe first cycle, both dst7 and dst2 are written to the port, and the ex- 
ternal-bus access for dst? begins. 


a) The store for dst? is completed on the second cycle. 
b) The store for dst2 begins on the third external-bus cycle. 


c) The store for dst2 is completed on the fourth external-bus cycle. 


8.5.2.4 Parallel Multiplies and Adds 


Memory addressing for parallel multiplies and adds is similar to that for 3-operand 
instructions. The parallel multiplies and adds include all instructions with bits 
31-30 = 10 (see Figure 8-7). 


Figure 8-7. Parallel Multiplies and Adds 


8-30 


3 


| 
o| overt 


24 23 16 15 8 7 0 
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For these operations, src3 and src4 are both located in memory. If both operands 
are located in internal memory, src3 is performed during H3, and src4 is per- 
formed during H1, thus completing two memory reads in a single cycle. 


L 


If src3 is in internal memory and src4 is in external memory, the src4 access 
begins at the start of H3 and latches at the end of H1. At the same time, the 
src3 access to internal memory is performed during H3. Again, two memory 
reads are completed in a single cycle. 


If src3 is in external memory and src4 is in internal memory, two cycles 
are necessary to complete the two reads. In the first cycle, the internal src4 
access is performed. During the H3 of the next cycle, the src3 access is 
performed. 


If src3 and src4 are both from external memory, two cycles are necessary 
to complete the two reads. In the first cycle, the src3 access is performed; 
in the second cycle, the src4 access is performed. 
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TMS320C30 and TMS320C31/VC33 
External-Memory Interface 


This chapter describes the ‘C30 and ’C31/VC33 external-memory interface. 
See Chapter 10, Enhanced External-Memory Interface, for detailed informa- 
tion on the ’C32 external bus operation. 


Memories and external peripheral devices are accessible through two external 
interfaces on the ’C30: 


_j Primary bus 
_j) Expansion bus 


On the ’C31/VC33, one bus, the primary bus, is available to access external 
memories and peripheral devices. You can control wait-state generation, permit- 
ting access to slower memories and peripherals, by manipulating memory- 
mapped control registers associated with the interfaces and by using an external 
input signal. 
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Note: VC33 


The TMS320VC33 is logically and functionally a superset of the TMS320C31 
making nearly all literature and documentation for the TMS320C31 applica— 
ble. However, the electrical characteristics are different and are covered in 
detail in the TMS320VC33 data sheet (literature number SPRS087). 
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Overview 


9.1 Overview 


9-2 


The ’C30 provides two external interfaces: the primary bus and the expansion 
bus. The TMS320C31/VC33 provides one external interface: the primary bus. 
The primary bus consists of a 32-bit data bus, a 24-bit address bus, and a set 
of control signals. The expansion bus consists of a 32-bit data bus, a 13-bit 
address bus, and a set of control signals. Each interface has the following 
features: 


1 Separate configurations controlled by memory-mapped external interface 
control registers 


[1 Hold request and acknowledge signal for putting the external memory inter- 
face signals in high impedance mode and preventing the processor from 
accessing the external bus 


1 Selectable wait state that can be controlled through software, hardware, 
or combination of software and hardware 


(4 Unified memory space for data, program, and I/O access 
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9.2 Memory Interface Signals 


This section describes the differences between the ‘C30 and ’C31/VC33 
memory interface signals. 


9.2.1 TMS320C30 Memory Interface Signals 


The TMS320C30 has two sets of control signals as follows: 


<4) Primary bus control signals: STRB, R/W, HOLD, HOLDA, RDY 


Table 9-1 lists and describes the signals. 


_} Expansion bus control signals: MSTRB, IOSTRB, XR/W, XRDY 


Table 9-2 lists and describes the expansion bus control signals. 


Access is determined by an active strobe signal (STRB, MSTRB, or IOSTRB). 
When a primary bus access is performed, STRB is low. The expansion bus of 
the ’C30 supports two types of accesses: 


(4 Memory access signaled by MSTRB low. The timing for an MSTRB access 
is the same as that of the STRB access on the primary bus. 


_j} External peripheral device access is signaled by IOSTRB low. 


Each of the buses (primary and expansion) has an associated control register. 
These registers are memory-mapped as shown in Figure 9-1. 


9.2.2 TMS320C31/VC33 Memory Interface Signals 
The TMS320C31/VC33 has one set of control signals: 
Primary bus control signals: STRB, R/W, HOLD, HOLDA, RDY 


STRB is low when an external bus access is performed. The primary bus 
control register controls its behavior (see Section 9.3). 
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Table 9-1. Primary Bus Interface Signals 


ca) fa 


Signal Typet 


STRB oz 
R/W O/Z 
HOLD l 


HOLDA O/Z 
RDY | 


A(23-0) O/Z 


D(31-0) WVO/Z 


TI Input 
O Output 
Z_ High impedance 


+NA means not affected. 
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Description 
Primary interface access strobe 


Specifies memory read (active high) or write 
(active low) mode 


Hold external memory interface 
Hold acknowledge for external memory interface 


Indicates external primary interface is ready to 
be accessed 


Primary address bus. When the primary bus 
address lines are not in high-impedance state 
due to HOLD signal, they keep in the last exter- 
nal primary bus access. 


Primary data bus. These signals go to high- 
impedance between write accesses. 


Value 
After Reset 


HIZ 


Idle Status 
1 


{ 


Ignored 
1 


Ignored 


Address of last external 
bus access 


HIZ 


Table 9-2. Expansion Bus Interface Signals 


Memory Interface Signals 


Signal 
MSTRB 
IOSTRB 


XR/W 


XRDY 


XA (12-0) 


XD (31-0) 


TI Input 
O Output 


Typet 
O/Z 
O/Z 


V/O/Z 


Z_ High impedance 


+NA means not affected. 


Description 
Expansion bus memory access strobe 
Expansion bus peripheral-access strobe 


Specifies memory (active high) or write (active 
low) mode 


Indicates external expansion interface is ready 
to be accessed 


Expansion address bus. When the expansion 
bus address lines are not in high-impedance 
state due to HOLD signal, they keep the last 
external expansion bus access. 


Expansion data bus. These signals go to high- 
impedance between write accesses. 


Value 
After Reset 


{ 
{ 
{ 


HIZ 


Idle Status 
1 


{ 
1 


Ignored 


Address of last external 
expansion bus access 


HIZ 
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Figure 9-1. Memory-Mapped External Interface Control Registers 


Peripheral 
Address 


808060h 
g0806th 
808062h 
808063h 
808064h 
808065h 
808066h 
808067h 
808068h 
808069h 
80806Ah 
80806Bh 
80806Ch 
80806Dh 
80806Fh 


Memory Interface Control Registers 


9.3. Memory Interface Control Registers 


Two memory interface control registers, the primary-bus control register and 
the expansion-bus control register, are described in this section. 


9.3.1. Primary-Bus Control Register 


The primary bus control register is a 32-bit register that contains the control bits 
for the primary bus (see Figure 9-2). Table 9-3 describes the register bits with 
the bit names and functions. 


Figure 9-2. Primary-Bus Control Register ‘att 


31-161 1513 12+ 11 10 9 8:17 #6 5 


' 
4. 3 2 1 0 
XX | XX : BNKCMP : WTCNT NOHOLD | HOLDST 
Riw R/W R/W R 


' ' R/W ' R/W 


Notes: 1) xx =reserved bit, read as 0 
2) R=read, W =write 


Note: 


After changing the bit fields of the primary-bus control register, up to three 
instructions are fetched before the primary bus is reconfigured because the 
configuration change is performed in the execute stage of the pipeline. 
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Table 9-3. Primary-Bus Control Register Bits 


Abbreviation Reset Value Name Description 


HOLDST 0 Hold status bit This bit signals whether the port is being 
held (HOLDST = 1) or is not being held 
(HOLDST = 0). This status bit is valid 
whether the port has been held through 
hardware or software. 


NOHOLD 0 Port hold signal NOHOLD allows or disallows the port to be 
held by an external HOLD signal. When 
NOHOLD = 1, the ’C3x takes over the 
external bus and controls it, regardless of 
serviced or pending requests by external 
devices. No hold acknowledge (HOLDA) is 
asserted when a HOLD signal is received. It 
is asserted if an internal hold is generated 
(HIZ = 1). 


HIZ 0 Internal hold When set (HIZ = 1), the port is put in hold 
mode. This is equivalent to the external 
HOLD signal. By forcing a high-impedance 
condition, the ’C3x can relinquish the exter- 
nal-memory port through software. HOLDA 
goes low when the port is placed in the high- 
impedance state. 


SWW 11 Software wait mode In conjunction with WTCNT, this 2-bit field 
defines the mode of wait-state generation. 
(See Table 9-5.) 


WTCNT 111 Software wait mode This three-bit field specifies the number of 
cycles to use when in software wait mode for 
the generation of internal wait states. The 
range is 0 (WTCNT = 00 0) to 7 (WTCNT=1 
1 1) H1/H3 cycles. (See Section 9.4.) 


BNKCMP 10000 Bank compare This 5-bit field specifies the number of MSBs 
of the address to be used to define the bank 
size. (See Table 9-6.) 
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9.3.2 Expansion-Bus Control Register 


The expansion-bus control register is a 32-bit register that contains control bits 
for the expansion bus (see Figure 9-3 and Table 9-4). 


Figure 9-3. Expansion-Bus Control Register 


Notes: 1) xx =reserved bit, read as 0 
2) R=read, W =write 


Table 9-4. Expansion-Bus Control Register Bits 


Abbreviation Reset Value Name Description 
SWW 11 Software wait mode In conjunction with the WTCNT, 2-bit field 


defines the mode of wait-state generation. 
(See Table 9-5.) 


WTCNT 111 Software wait mode This 3-bit field specifies the number of cycles 
to use when in software wait mode for the 
generation of internal wait state. The range is 
0 (WTCNT = 000) to 7 (WTCNT = 1) H1/H3 
cycles. (See Section 9.4.) 


Te | 


Note: 


After changing the bit fields of the expansion-bus control register, up to three 
instructions are fetched before the expansion bus is reconfigured because 
the configuration change is performed in the execute stage of the pipeline. 
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9.4 Programmable Wait States 


The ’C8x has its own internal software-configurable ready-generation capability 
for each strobe. This software wait-state generator is controlled by configuring 
two bit fields in the primary or expansion bus interface control registers. 


Use the WTCNT field to specify the number of software wait-states to generate 
and use the SWW field to select one of the following four modes of wait-state 
generation: 


Lj External RDY wait states are generated solely by the external RDY line 
ignoring software wait states. 


LJ WTCNT-generated RDYwicnt wait states are generated solely by the soft- 
ware wait-state generator ignoring external RDY signals. 


LJ Logical-AND of RDY and RDYwicnt wait states are generated with a logical 
AND of internal and external ready signals. Both signals must occur. 


LJ Logical-OR of RDY and RDYwicn;t wait states are generated with a logical 
OR of internal and external ready signals. Either signal can generate the 
ready signal. 


The four modes are used to generate the internal ready signal, RDYjn}, that 
controls accesses. As long as RDYj,; = 1, the current external access is 
delayed. When RDYjn; = 0, the current access completes. Since the use of 
programmable wait states for both external interfaces is identical, only the 
primary bus interface is described in the following paragraphs. 


RDY tent is an internally-generated ready signal. When an external access is 
begun, the value in WTCNT is loaded into a counter. WTCNT can be any value 
from 0 through 7. The counter is decremented every H1/H3 clock cycle until 
it becomes 0. Once the counter is set to 0, it remains set to 0 until the next 
access. While the counter is nonzero, RDYwtent = 1. While the counter is 0, 
RDYwtent = 0- 


Table 9-5 shows the truth table for each value of SWW and the different 
combinations of RDY, RDYwtent, and RDYjnt. 


Se 
Note: 


At reset, the ’C3x is programmed with seven wait states for each external 
memory access. These wait states are inserted to ensure the system can 
function with slow memories. To maximize system performance when acces- 
sing external memories, you need to decrease the number of wait states. 


After changing the wait states, up to three instructions are fetched before the 


change in the wait state occurs. 
Cn! 


Programmable Wait States 


Table 9-5. Wait-State Generation 


Inputs Output 
SWW Bit Field /RDYext /RDYwtcnt /RDYint Functional Description 
00 0 x 0 Wait until external RDY is signaled 
1 X 1 
01 X 0 0 Wait until internal wait state generator 
counts down to 0 
X 1 i 
10 0 0 0 Wait until first signal: external RDY or the 
0 1 0 internal wait state generator (logical OR) 
1 0 0 
1 1 1 
11 0 0 0 Wait until both external RDY is signaled 
0 1 { and wait state generator counts down to 
0 (logical AND) 
1 0 1 
1 1 | 
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9.5 Programmable Bank Switching 


Programmable bank switching allows you to switch between external memory 
banks without having to insert wait states externally due to memories that require 
several cycles to turn off. Bank switching is implemented on the primary bus only. 


The size of a bank is determined by the number of bits specified by the BNKCMP 
field of the primary bus control register. For example, if BNKCMP = 16, the 16 
MSBs of the address are used to define a bank (see Figure 9-4). Since 
addresses are 24 bits, the bank size is specified by the eight LSBs, yielding a 
bank size of 256 words. If BNKCMP = 16, only the 16 MSBs are compared. Bank 
sizes from 28 = 256 to 224 = 16M are allowed. Table 9-6 summarizes the relation- 
ship between BNKCMP, the address bits used to define a bank, and the resulting 
bank size. 


Figure 9-4. BNKCMP Example 


le 24-bit address >| 
23 8 |7 0 


Eas as 


}¢————— Number of bits to compare ——_— Defines bank size —| 


Table 9-6. BNKCMP and Bank Size 


BNKCMP MSBs Defining a Bank Bank Size (32-Bit Words) 
00000 None 224 = 16M 
00001 23 223 = 8M 
00010 23-22 222 - 4M 
00011 23-21 221 = 2M 
00100 23-20 220=1M 
00101 23-19 219 = 512K 
00110 23-18 218 = 256K 
00111 23-17 217 = 128K 
01000 23-16 216 — 64K 
01001 23-15 215 = 32K 
01010 23-14 214 = 16K 
01011 23-13 213 = 8K 
01100 23-12 212 — 4K 
01101 23-11 211 = 2K 
01110 23-10 210 =1K 
01111 23-9 29 =512 
10000 23-8 28 = 256 
10001-11111 Reserved Undefined 


Programmable Bank Switching 


The ’C3x has an internal register that contains the MSBs (as defined by the 
BNKCMP field) of the last address used for a read or write over the primary inter- 
face. At reset, the register bits are set to 0. If the MSBs of the address being used 
for the current primary interface read do not match those contained in this 
internal register, a read cycle is not asserted for one H1/H3 clock cycle. During 
this extra clock cycle, the address bus switches over to the new address, but 
STRB is inactive (high). The contents of the internal register are replaced with 
the MSBs being used for the current read of the current address. If the MSBs 
of the address being used for the current read match the bits in the register, a 
normal read cycle takes place. 


If repeated reads are performed from the same memory bank, no extra cycles 
are inserted. When a read is performed from a different memory bank, an extra 
cycle is inserted. This feature can be disabled by setting BNKCMP to 0. The 
insertion of the extra cycle occurs only when a read is performed. The 
changing of the MSBs in the internal register occurs for all reads and writes 
over the primary interface. 


Figure 9-5 shows the addition of an inactive cycle when switches between 
banks of memory occur. 
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Figure 9-5. Bank-Switching Example 


' Extra ’ 
cycle 


Note: 


After changing BNKCMP, up to three instructions are fetched before the 
change in the bank size occurs. 
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9.6 External Memory Interface Timing 


9.6.1 


This section discusses functional timing of operations on the primary bus and the 
expansion bus, the two independent parallel buses or the ‘C3x devices. 


The parallel buses implement three mutually exclusive address spaces distin- 
guished through the use of three separate control signals: STRB, MSTRB, and 
IOSTRB. The STRB signal controls accesses on the primary bus, and the 
MSTRB and IOSTRB signals control accesses on the expansion bus. Since 
the two buses are independent, you can make two accesses in parallel. 


With the exception of bank switching and the external HOLD function (discussed 
later in this section), timing of primary bus cycles and MSTRB expansion bus 
cycles are identical and are discussed collectively. The abbreviation (M)STRB is 
used in references that pertain equally to STRB and MSTRB. Similarly, (X)R/W, 
(X)A, (X)D, and (X)RDY are used to symbolize the equivalent primary and expan- 
sion bus signals. The IOSTRB expansion bus cycles are timed differently and are 


discussed independently. 


Primary-Bus Cycles 


All bus cycles comprise integral numbers of H1 clock cycles. One H1 cycle is 
defined to be from one falling edge of H1 to the next falling edge of H1. For 
full-speed (zero wait-state) accesses, writes require two H1 cycles and reads 
require one cycle; however, if the read follows a write, the read requires two 
cycles. This applies to both the primary bus and the MSTRB expansion bus 
access. 


EOE ————————————————— a eh 


Note: Posted Write 


The data written to external memory by CPU or DMA is “latched” into the bus 
logic, allowing the CPU to continue with internal operation. Consequently, 
writes to external memory effectively require only one cycle if no accesses to 
that interface are in progress. However, if the next DMA or CPU access is to 
the same external bus, the DMA or CPU waits and the write is considered a 


2-cycle operation. This is normally referred to as posted-write. 
ee | 


The following discussions pertain to zero wait-state accesses unless otherwise 
specified. 
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The (M)STRB signal is low for the active portion of both reads and writes. The 
active portion lasts one H1 cycle. Additionally, before and after the active portion 
((M)STRB low) of writes only, there is a transition cycle of H1. This transition 
cycle consists of the following sequence: 


1) (M)STRB is high. 
2) If required, (X)R/W changes state on H1 rising. 


3) If required, address changes on H1 rising if the previous H1 cycle was the 
active portion of a write. If the previous H1 cycle was a read, address 
changes on the next H1 falling. 


Figure 9-6 illustrates a read-read-write sequence for (M)STRB active and no 
wait states. The data is read as late in the cycle as possible to allow maximum 
access time from address valid. Although external writes require two cycles, 
internally (from the perspective of the CPU and DMA) they require only one 
cycle if no accesses to that interface are in progress. In the typical timing for 
all external interfaces, the (X)R/W strobe does not change until (M)STRB or 
IOSTRB goes inactive. 


External Memory Interface Timing 


Figure 9-6. Read-Read-Write for (M)STRB = 0 


Note: (x) RDY is sampled low on rising edge of H1. Data is read next falling edge of H1. 


Note: Back-to-Back Read Operations 


(M)STRB remains low during back-to-back read operations. 
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Figure 9-7 illustrates a write-write-read sequence for (M)STRB active and no 
wait states. The address and data written are held valid approximately one-half 


cycle after (M)STRB changes. 


Figure 9-7. Write-Write-Read for (M)STRB = 0 


|e) ==} Cae) =< 
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Figure 9-8 illustrates a read cycle with one wait state. Since (X)RDY = 1, the 
read cycle is extended. (M)STRB, (X)R/W, and (X)A are also extended one 
cycle. The next time (X)RDY is sampled, it is 0. 


Figure 9-8. Use of Wait States for Read for (M)STRB = 0 
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Figure 9-9 illustrates a write cycle with one wait state. Since initially (X)RDY = 1, 
the write cycle is extended. (M)STRB, (X)R/W, and (X)A are extended one cycle. 
The next time (X)RDY is sampled, it is 0. 


Figure 9-9. Use of Wait States for Write for (M)STRB = 0 


= Write data | 
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9.6.2 Expansion-Bus I/O Cycles 


In contrast to primary bus and MSTRB cycles, IOSTRB reads and writes are 
both two cycles in duration (with no wait states) and exhibit the same timing. 
During these cycles, address always changes on the falling edge of H1, and 
IOSTRB is low from the rising edge of the first H1 cycle to the rising edge of 
the second H1 cycle. The IOSTRB signal always goes inactive (high) between 
cycles, and XR/W is high for reads and low for writes. 


Figure 9-10 illustrates read and write cycles when IOSTRB is active and there 
are no wait states. For IOSTRB accesses, reads and writes require a minimum 
of two cycles. Some off-chip peripherals might change their status bits when 
read or written to. Therefore, it is important to maintain valid addresses when 
communicating with these peripherals. For reads and writes when IOSTRB is 
active, IOSTRB is completely framed by the address. 


Figure 9-10. Read and Write for |OSTRB = 0 


IOSTRB 
XR/W 


XD Write data 


XRDY 
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Figure 9-11 illustrates a read with one wait state when IOSTRB is active, and 
Figure 9-12 illustrates a write with one wait state when IOSTRB is active. For 
each wait state added, IOSTRB, XR/W, and XA are extended one clock cycle. 
Writes hold the data on the bus one additional cycle. The sampling of XRDY 
is repeated each cycle. 


Figure 9-11.Read With One Wait State for IOSTRB = 0 


IOSTRB 
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Figure 9-12. Write With One Wait State for |OSTRB = 0 


IOSTRB 


XR/W 


XA 


XD 


XRDY 
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Figure 9-13 through Figure 9-23 illustrate the various transitions between 
memory reads and writes, and I/O writes over the expansion bus. 


Figure 9-13. Memory Read and I/O Write for Expansion Bus 


MSTRB 
IOSTRB 
XR/W 


\/O address 


XRDY 
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Figure 9-14. Memory Read and I/O Read for Expansion Bus 
H3 

H1 
MSTRB 
IOSTRB 
XR/W 
XA 


XD I/O read 


XRDY 
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Figure 9-15. Memory Write and I/O Write for Expansion Bus 


H1 


MSTRB 


IOSTRB 


XR/W 


XA 


XD 


XRDY 
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Figure 9-16. Memory Write and I/O Read for Expansion Bus 


H1 
MSTRB 
IOSTRB 

XR/W 


XA Memory address \/O address 
VO read 


XD 


XRDY 
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Figure 9-17. I/O Write and Memory Write for Expansion Bus 


H1 


MSTRB 

IOSTRB 

XR/W 
XA \/O address Memory address 

10 
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Figure 9-18. I/O Write and Memory Read for Expansion Bus 


MSTRB 


iOSTRB | 
XR/W 
XD < vowrte SY write a , 3 


XRDY 
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Figure 9-19. I/O Read and Memory Write for Expansion Bus 


MSTRB \ ' 

' ' ' \ ' ' ; ; 

: | | | | ! , ! 

lIOSTRB \ | ' ' 

a Le : : : 
XA \/O address Memory address 


TROY /—— * : 
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Figure 9-20. I/O Read and Memory Read for Expansion Bus 


H3 


! ! 
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Figure 9-21. I/O Write and I/O Read for Expansion Bus 
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Figure 9-22. I/O Write and I/O Write for Expansion Bus 
H3 

H1 
MSTRB 
IOSTRB 
XR/W 
Xn 

XD 


XRDY 
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Figure 9-23. I/O Read and I/O Read for Expansion Bus 
H3 

H1 
MSTRB 
iOSTRB 


XR/W 


XA 


XD 


XRDY 


9-34 


External Memory Interface Timing 


Figure 9-24 and Figure 9-25 illustrate the signal states when a bus is inactive 
(after an IOSTRB or (M)STRB access, respectively). The strobes (STRB, 
MSTRB and IOSTRB) and (X)R/W) go to 1. The address is driven with last exter- 
nal bus access, and the ready signal (XRDY or RDY) is ignored. 


Figure 9-24. Inactive Bus States for |OSTRB 


H1 
iOSTRB 
XR/W = 
a\y 

a 


XRDY XRDY ignored 


Bus inactive >| 


a- 
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Figure 9-25. Inactive Bus States for STRB and MSTRB 


4 Write data 


(X)RDY ignored 
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9.6.3 Hold Cycles 


Figure 9-26 illustrates the timing for HOLD and HOLDA. HOLD is an external 
asynchronous input. There is a minimum of one cycle delay from the time when 
the processor recognizes HOLD = 0 until HOLDA = 0. When HOLDA = 0, the 
address, data buses, and associated strobes are placed in a high-impedance 
state. All accesses occurring over an interface are completed before a hold is 
acknowledged. 


Figure 9-26. HOLD and HOLDA Timing 


Write data 


inactive 
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Chapter 10 


TMS320C32 Enhanced External Memory 
Interface 


The ’C32 external memory interface provides greater flexibility by improving 
the ’C3x core with several new features. This chapter describes these features 
and enhancements in detail. 
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10.1 TMS320C32 Memory Features 


10-2 


The ’C32 external memory interface includes the following features: 


L 


L 


One external pin, PRGW, configures the external-program-memory width 
to 16 or 32 bits. 


Two sets of memory strobes (STRBO and STRB1) and one IOSTRB allow 
zero glue-logic interface to two banks of memory and one bank of external 
peripherals. 


Separate bus control registers for each strobe-control wait-state genera- 
tion, external memory width, and data-type size. 


Each memory STRB handles 8-, 16- or 32-bit external data accesses 
(reads and writes) to 8-, 16-, or 32-bit-wide memory. 


Multiprocessor support through the HOLD and HOLDA signals, is valid for 
all the STRBs. 
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10.2 TMS320C32 Memory Overview 


The following sections describe examples, control register setups, and 
restrictions necessary to fully understand the operation and functionality of the 
external memory interface. 


10.2.1 External Memory Interface Overview 


The ’C32 memory interface accesses external memory through one 24-bit 
address and one 32-bit data bus that is shared by three mutually-exclusive 
strobes (STRBO, STRB1, and IOSTRB). Depending on the address accessed, 
the ‘C32 activates one of these strobes according to the memory map shown 
in Figure 4-3 on page 4-8. 


STRBO and STRB1 can access 8-, 16-, or 32-bit data from 8-, 16-, or 32-bit 
wide memory. This is accomplished by four signals in each strobe: 
STRBx_B3/A_1, STRBx_B2/A_o, STRBxB1, and STRBx_BO. These signals 
serve as byte-enable pins to access one byte, half word, or a full word from the 
external memory. The first two signals also serve as additional address pins 
to perform two or four consecutive accesses in 8-bit or 16-bit-wide external 
memory. The ’C32 controls the behavior of these pins through the data size 
and memory width bit fields in the corresponding strobe control register, as 
follows: 


_) Memory width (default value dependent on PRGW pin level) 
H  8-bit-wide memory 


= STRBx_B3/A_; and STRBx_B2/A_9 as address pins 
= STRBx_BO as byte-enable/chip-select signal 
= STRBx_B1 unused 


m 16-bit-wide memory 


= STRBx_B3/A_; as address pin 
= STRBx_B1 and STRBx_BO as byte-enable signal 
= STRBx_B2 unused 


B 32-bit-wide memory 


= STRBx_B3, STRBx_B2, STRBx_B1, and STRBx_BO as byte- 
enable signals 
_) Data size 


HM  8-bit data, physical address = logical address shift right by 2 
m 16-bit data, physical address = logical address shift right by 1 
M@ 32-bit data, physical address = logical address 
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IOSTRB can access 32-bit data from 32-bit wide memory. It does not have the 
flexibility of STRBO and STRB1 since it is composed of a single signal: 
IOSTRB. IOSTRB bus cycles are different from those of STRBO and STRB1 
and are discussed in Section 10.10. This timing difference accomodates 
slower I/O peripherals. 


The ’C32 memory interface parallel bus implements three mutually-exclusive 
address spaces distinguished via three separate control signals as shown in 
Figure 10-1. STRBO and STRB1 support 8-, 16-, or 32-bit data access in 8-, 
16-, 32-bit-wide external memory and 16-, 32-bit program access in 16-/32-bit- 
wide external memory. IOSTRB address space supports 32-bit data/program ac- 
cess in 32-bit-wide external memory. Internally, the ‘C32 has a 32-bit architecture, 
hence, the memory interface packs and unpacks the data accessed accordingly. 


Figure 10-1. Memory Address Spaces 


8-, 16-, 32-bit data in 
8-, 16-, 32-bit-wide memory 


Program in 16-, 32-bit-wide memory 
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8-, 16-, 32-bit data in 
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Program in 16-, 32-bit-wide memory 
Memory 
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control 
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32-bit data in 32-bit-wide 
IOSTRB memory 


Program in 32-bit-wide memory 


10.2.2 Program Memory Access 


The ’C32 supports program execution from 16- or 32-bit external memory 
width. The PRGW pin configures the width of the external program memory. 
When this pin is pulled high, the C32 executes from 16-bit wide memory. 
When this pin is pulled low, the C32 executes from 32-bit wide memory. For 
16-bit wide zero wait-state memory, the ‘C32 takes two instruction cycles to 
fetch a single 32-bit instruction. During the first cycle the lower 16 bits of the 
instruction are fetched. During the second cycle, the upper 16 bits are fetched 
and concatenated with the lower 16 bits. 32-bit memory fetches are identical 
to those of the ’C30 and ’C31. 
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The PRGW status bit field of the CPU status (ST) register reflects the setting 
of the PRGW pin. Figure 10-2 depicts all the bit fields of the CPU status (ST) 
register. 


Figure 10-2. Status aa 


31-16 + 14 1209 11 


' » PRGW INT 


R/IW R/W AW! R/W R/W AW! R/IW R/W R/W Rw R/W R/W R/W R/W 


Notes: 1) xx =reserved bit, read as 0 


The status of the PRGW pin also affects the reset value of the physical memory 
width bit fields of the STRBO and STRB1 bus-control registers. The physical 
memory width is set to 32-bit memory width if the PRGW pin is logic low after 
the device reset. The physical memory width is set to 16-bit memory width if 
the PRGW pin is logic high after the device reset (see Section 10.3 for more 
information). 


The cycle before and the cycle after changing the PRGW should not 
perform a program fetch over the external memory interface. 


10.2.3 Data Memory Access 


The ’C32 can load and store 8-, 16-, or 32-bit data quantities from and into 
memory. Because the CPU has a 32-bit architecture, the device internally 
handles all 8-, 16-, or 32-bit data quantities as a 32-bit value. Hence, the external 
memory interface handles the conversion between 8- and 16-bit data quantities 
to the internal 32-bit representation. The external memory interface also 
handles the storage of 32-, 16-, or 8-bit data quantities into 32-, 16-, or 8-bit wide 
memories. 


10.2.3.1 8-, 16-, or 32-Bit Integers Data Types 


The ’C32 supports 8-, 16- or 32-bit integer data quantities. When 8- or 16-bit 
integers are read from external memory, the value is loaded into the LSBs of 
the register with the MSBs sign-extended or zero-filled. The polarity of the sign 
ext/zero-fill bit field of the corresponding STRB control register controls the 
sign extension or zero fill (see paragraphs 10.3.1.1 and 10.3.1.2). The 32-bit 
integer data access is identical to that of the ‘C30 and ’C31. 
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10.2.3.2  16- or 32-Bit Floating-Point Data Types 
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The ’C32 supports 16- or 32-bit floating point data. For 16-bit floating-point 
reads, the eight MSBs are the signed exponent and the eight LSBs are the 
signed mantissa (see Section 5.3.2, ‘C32 Short Floating-Point Format for 
External 16-Bit Data, on page 5-6). When a 16-bit floating-point value is loaded 
into a 40-bit register, the external memory interface zero fills the least signifi- 
cant 24 bits of the register. When a 16-bit floating-point value is used as a 32-bit 
on-chip input operand, the external memory interface zero fills the 16 LSBs of 
the 32-bit input operand. The 32-bit floating-point data access is identical to 
that of the ‘C30 and ’C31. 


Configuration 


10.3 Configuration 


To access 8-, 16-, or 32-bit data (types) from 8-, 16-, or 32-bit wide memory, the 
memory interface of the C32 device uses either strobe STRBO or STRB1 with 
four pins each. These pins serve as byte-enable and/or additional-address pins. 
In conjunction with a shifted version of the internal address presented to the exter- 
nal address, the ‘C32 can select a single byte from one external memory location 
or combine up to four bytes from contiguous memory locations. The behavior of 
these pins is controlled by the external memory width and the data type size. The 
selected data size also determines the amount of internal-to-physical address 
shift. You can assign these values to the ‘C32 memory interface through bit fields 
in the bus control registers. 


10.3.1 External Interface Control Registers 


The following sections describe the bus control registers used to manipulate 
the byte addressability features of the ‘C32. Figure 10-3 shows the external 
interface control memory map. 


Figure 10-3. Memory-Mapped External Interface Control Registers 


Address Register 
808060h IOSTRB control 
808061h Reserved 
808062h Reserved 
808063h Reserved 
808064h STRBO control 
808065h Reserved 
808066h Reserved 
808067h Reserved 
808068h STRB1 control 
808069h Reserved 
80806Fh Reserved 
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10.3.1.1_ STRBO Control Register 


The STRBO control register (Figure 10-4) is a 32-bit register that contains the 
control bits for the portion of the external bus memory space that is mapped to 
STRBO. The following table lists the register bits with the bit names and functions. 
At the system reset, OF 10F8h is written to the STRBO control register if the PRGW 
pin is logic low and 0710F 8h is written to the STRBO control register if the PRGW 
pin is logic high. 


Figure 10-4. STRBO Control Register 


86.1 (BF 24 123 22 17 16 
switch | config | Zero fill width 
; R/W ! R/W R/W ' R/W 
15 13. toni G17 5 4 1 9 2 1 0 
XX ' BNKCMP | WTCNT SWW HIZ | NOHOLD | HOLDST 
' R/W : R/W R/AW R/W R/W R 


Notes: 1) R=read, W =write 
2) Xxx = reserved, read as 0 


10.3.1.2 STRB1 Control Register 


The STRB1 control register (Figure 10-5) is a 32-bit register that contains the 
control bits for the portion of the external bus memory space that is mapped 
to STRB1. Figure 10-5 shows the register bits with their names and functions. 
At system reset, OF 10F8h is written to the STRB1 control register if the PRGW 
pin is logic low and 0710F8h is written to the STRB1 control register if the 
PRGW pin is logic high. 


Figure 10-5. STRB1 Control Register 


31 24123 ai 20 (19 18 17 16/15 13 


Sign ext/ | ' Physical 


' ' UME mOry ' 
zero fill width : 
, RW R/W | RW RW | 


BNKCMP | 
' 


R/W 


Notes: 1) R=read, W =write 
2) xx = reserved, read as 0 
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Configuration 


The instruction immediately preceding a change in the data-size or 
memory-width bit fields should not perform a multicycle store. Do 
not follow a change in the data-size or memory-width bit fields with 


a store instruction. Also, do not perform a load in the next two 
instructions following a change in the data-size or memory-width 
bit fields 


10.3.1.3| IOSTRB Control Register 


The IOSTRB control register (Figure 10-6) is a 32-bit register that contains the 
control bits for the portion of the external bus memory space that is mapped to 
IOSTRB. Unlike the STRBO and STRB1, there is no byte-enable signal for the 
IOSTRB. The data access through the IOSTRB is always 32-bit. The following 
table lists the register bits with the bit names and functions. At the system reset, 
OF8h is written to the IOSTRB control register. The IOSTRB timing is identical 
to the ‘C30 IOSTRB timing. 


Figure 10-6. |OSTRB Control Register 


31 16115 12044 8 7 514 3 2 0 


XX y xX y XX WTCNT ‘ SWW 


: : R/W ' R/W 


‘ ‘ L 
Notes: 1) R=read, W =write 


2) xx = reserved, read as 0 


Note: 


After changing the bit fields of the IOSTRB control register, up to three 


instructions are fetched before the IOSTRB bus is reconfigured. 
| ee | 
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Table 10-1. 


Abbreviation 


HOLDST 


NOHOLD 


HIZ 


SWW 


WTCNT 


BNKCMP 


Data type size 
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Table 10-1 describes the bits in the STRBO, STRB1, and the IOSTRB control 


registers. 


STRBO, STRB1, and IOSTRB Control Register Bits 


Reset 
Value 


0 


11 


111 


10000 


11 


Name 


Hold status bit 


Port hold signal 


Internal hold 


Software wait mode 


Software wait mode 


Bank compare 


(STRBO and STRB1 
control registers only) 


Description 


This bit signals whether the port is being held (HOLDST = 1), 
or is not being held (HOLDST = 1). This status bit is valid 
whether the port has been held through hardware or soft- 
ware. (STRBO control register only) 


NOHOLD allows or disallows the port to be held by an exter- 
nal HOLD signal. When NOHOLD = 1, the ’C3x takes over 
the external bus and controls it, regardless of serviced or 
pending requests by external devices. No hold acknowledge 
(HOLDA) is asserted when a HOLD is received. However, it 
is asserted if an internal hold is generated (HIZ = 1). (STRBO 
control register only) 


When set (HIZ = 1), the port is put in hold mode. This is 
equivalent to the external HOLD signal. By forcing the high- 
impedance condition, the ’C3x can relinquish the external 
memory port through software. HOLDA goes low when the 
port is placed in the high impendance state. (STRBO control 
register only) 


In conjunction with WTCNT, this 2-bit field defines the mode 
of wait-state generation. 


This 3-bit field specifies the number of cycles to use when 
in the software wait mode for the generation of internal wait 
state. The range is 0 (WTCNT = 0 0 0) to 7 (WTCNT = 111) 
H1/H3 cycles. 


This 5-bit field specifies the number of MSBs of the address to 
be used to define the bank size. (STRBO and STRB1 control 
registers only) 


Indicates the size of the data type written in memory. 


Bit 17 Bit 16 Data Type Size 
0 0 8 bit 

0 1 16 bit 

1 0 Reserved 

1 1 32 bit 


Configuration 


Table 10-1. STRBO, STRB1, and IOSTRB Control Register Bits (Continued) 


Reset 
Abbreviation Value Name 
Physical 01 (STRBO and STRB1 
memory width or control registers only) 


11 


Description 


Indicates the size of the physical memory connected to the 
device. The “reset” value depends on the status of the 
PRGW pin. If the PRGW pin is logic low, the memory width 
is configured to 32 bits (= 119). If the PRGW pin is logic high, 
the physical memory width is configured to 16 bits (= 015). 
This field can have the following values: 


Bit17 Bit16 Data Type Size 


0 0 8 bit 

0 1 16 bit (reset value if PRGW = 1) 
1 0 Reserved 

il 1 32 bit (reset value if PRGW = 0) 


Setting the physical memory width field of the STRBO or 
STRB1 control registers changes the functionality of the 
STRBO or STRB1 signals. 


(When the physical memory width field is configured to 32 
bits, the corresponding STRBx_BO-STRBx_B3 signals 
are configured as byte-enable pins (see Figure 10-10 
on page 10-20). 


Ly When the physical memory width field is configured to 
16 bits, the corresponding STRBx_B3/A-1 signal is 
configured as an address pin while the STRBx_BO and 
STRBx_B1 signals are configured as byte-enable pins 
(see Figure 10-14 on page 10-26). 


J When the physical memory width field is configured to 
8 bits, the STRBx_B3/A-3 and STRBx_B2/A-2 signals 


are configured as byte-enable pins (see Figure 10-18 
on page 10-32). 


Once an STRBx_Bx signal is configured as_an address pin, 
it is active for any external memory access (STRBO, STRB1, 
IOSTRB, or external fetch). 
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Table 10-1. STRBO, STRB1, and IOSTRB Control Register Bits (Continued) 


Abbreviation 


Sign ext/ 
zero-fill 


STRB config 


STRB switch 


10-12 


Reset 
Value 


0 


0 


0 


Name 


(STRBO and STRB1 
control registers only) 


STRB configuration 


(STRBO control regis- 
ter only) 


Description 


Selects the method of converting 8- and 16-bit integer data 
into 32-bit integer data when transferring data from external 
memory to an internal register or memory location. This field 
can have the following values: 


Bit 20 Physical Memory Width 


0 8- or 16-bit integer reads are sign-extended to 
32 bits (reset value). 


1 The MSBs of 8- or 16-bit integer reads are zero- 
filled to make the number 32 bits. 


Activates the STRBO Bx signals when accessing data from 
STRBO or STRB1 memory spaces. This mode is useful 
when accessing a single external memory bank that stores 
two different data types, each mapped to a different STRB. 
This field can have the following values. 


Bit 21 Physical Memory Width 


0 STRBO_Bx signals are active for locations 
Oh-7FFFFFh and 880000h-8FFFFFh. 
STRB1_Bx signals are active for locations 
900000h-FFFFFFh (reset value). 


1 STRBO_Bx signals are active for locations 
Oh-7FFFFFh, 880000h-8FFFFFh and 
900000h-FFFFFFh. STRB1_Bx signals are 
active for locations 900000h-FFFFFFh. 


A functional representation of this configuration is shown in 
Figure 10-7 on page 10-13. 


Defines whether a single cycle is inserted between back-to- 
back reads when crossing STRBO or STRB1 to STRB1 to 
STRBO boundaries (switching STRBs). The extra cycle 
toggles the strobe signal during back-to-back reads. Other- 
wise, the strobe remains low during back-to-back reads. 
This field has the following values: 


Bit 22 Physical Memory Width 


0 Does not insert a single cycle between back- 
to-back reads that switch from STRBO to 
STRB1 or vice-versa (reset value). 


1 Inserts a single cycle between back-to-back 
reads that switch from STRBO to STRB1 or 
vice-versa (reset value). 


Configuration 


Figure 10-7. STRB Configuration 


STRBO_Bx 


STRBO_Bx 


STRB config 
STRB1_Bx 
STRB1_Bx 


10.3.2 Using Physical Memory Width and Data-Type Size Fields 


Consider a ’C32 connected to two banks of external memory. In this configura- 
tion, one bank is mapped to STRBO while the other bank is mapped to STRB1. 
The STRBO bank of memory is 32 bits wide and stores 32-bit data types. The 
STRB1 bank of memory is 16 bits wide and stores 16-bit data types. You can 
transfer these configurations to the ’C32 by setting the physical memory width 
and data-type size fields of the respective STRBO and STRB1 conirol registers. 
You also must clear the STRB config bit field to 0 since the banks are separate 
memories. Note that ‘C32 address pins As3A22Ae1...A;Ag are connected to the 
STRBO memory bank address pins Az3A22A91...A,Ag. But ‘C32 address pins 
AzeAo1...A{Ag A_1 are connected to the STRB1 memory-bank address pins 
Ao3A22Ao1 AY Ao. 


Executing the following code on this device results in the data-access 
sequence shown in Table 10-2. 


1) LDI 4000h, AR1 ; AR1 = 4000h 

2) LDI *AR1++, R2 ; R2 = *4000h and AR1 = AR1 + 1 

3) ADDI *AR1++, R2 ; R2 = R2 + *4001h and AR1 = AR1 + 1 
4) ADDI *AR1++, R2 ; R2 = R2 + *4002h and AR1 = AR1 + 1 
5) ADDI *AR1++, R2 ; R2 = R2 + *4003h and AR1 = AR1 + 1 
6) LDI 900h, AR2 ; AR2 = 900h 

7) LUSH 12, AR2 ; AR2 = 900000h 

8) LDI *AR2++, R3  ; R3 = *900000h and AR2 = AR2 + 1 

9) ADDI *AR2, R3 ; R3 = R3 + 900001h 
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By setting the bit fields of the STRBO bus control register with a physical- 
memory width of 32 bits and a data type size of 32 bits, the external address 
referring to the STRBO location is identical to the internal address used by the 
‘C32 CPU. Alternatively, setting the bit fields of the STRB1 bus control register 
with a physical memory width of 16-bit and a data-type size of 16-bit, the ad- 
dress presented by the ‘C32 external pins is the internal address shifted right 
by one bit with As3 driving As3 and Ago. Since the STRB1 memory-bank address 
pins Ap3AooAo1...A;Ag are connected to the ‘C32 address pins AgoAo,...A;A9A_1, 
the address seen by the STRB1 memory bank is identical to the ‘C32 CPU 
internal address. 


Table 10-2. Data-Access Sequence for a Memory Configuration with Two Banks 


Internal External Accessed 
Instruction # Address Bus Address Pins Active Strobe Byte Enable Data Pins 
(2) 4000h 4000h STRBO_B0/B1/B2/B3 D319 4000h 
(3) 4001h 4001h STRBO_B0/B1/B2/B3 D319 4001h 
(4) 4002h 4002h STRBO_B0/B1/B2/B3 D319 4002h 
(5) 4003h 4003h STRBO_B0/B1/B2/B3 D319 4003h 
(8) 900000h C80000h STRB1_B0/B1 and Di5-9 900000h 


STRB1_B3/A_j =0 
(9) 900001h C80001h STRB1_BO/B1 and Di5-9 900001h 


STRB1_B3/A_4 = 1 


The ability of the ‘C32 device to select a single byte from a single external 
memory location or combinations of bytes from several contiguous memory 
locations dictates that the internal address seen by the CPU correspond to a 
shifted version of the address presented to the external pins. The ‘C32 external 
memory interface handles this conversion automatically as long as you configure 
the bus control register to match the external memory configuration present in 
your hardware implementation. 


As seen in Figure 2-8 on page 2-20, ’C32 handles nine different memory 
access cases. The following sections discuss these cases in detail. 
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10.4 Programmable Wait States 


The ’C3x has its own internal software-configurable ready-generation capability 
for each strobe. This software wait-state generator is controlled by configuring 
two fields in the primary or expansion bus interface control registers. Use the 
WTCNT field to specify the number of software wait states to generate and use 
the SWW field to select one of the following four modes of wait-state generation: 


1) External RDY. Wait states are generated solely by the external RDY line 
ignoring software wait states. 

-) WTCNT-generated RDYwtcnt. Wait states generated solely by the software 
wait-state generator ignoring external RDY signals. 

1) Logical-AND of RDY and RDYwitcnt. Wait states generated with a logical 
AND of internal and external ready signals. Both signals must occur. 

1) Logical-OR of RDY and RDYwtcnt. Wait states are generated with a logical 
OR of internal and external ready signals. Either signal can generate ready. 


The four modes are used to generate the internal ready signal, RDY jp}, that 
controls accesses. As long as RDYjn; = 1, the current external access is 
delayed. When RDYjp; = 0, the current access completes. Since the use of 
programmable wait states for both external interfaces is identical, only the 
primary bus interface is described in the following paragraphs. 


RDYwtent is an internally generated ready signal. When an external access is 
begun, the value in WTCNT is loaded into a counter. WTCNT can be any value 
from 0 through 7. The counter is decremented every H1/H3 clock cycle until 
it becomes 0. Once the counter is set to 0, it remains set to 0 until the next 
access. While the counter is nonzero, RDYient = 1. While the counter is 0, 
RDY wtent = 0. 


Table 10-3 shows the truth table for each value of SWW and the different 
combinations of RDY, RDYwicnt, and RDY int. 


EE 
Note: 


At reset, the ‘C3x is programmed with seven wait states for each external 
memory access. These wait states are inserted to ensure the system can func- 
tion with slow memories. To maximize system performance when accessing 
external memories, you need to decrease the number of wait states. 


After changing wait states, up to three instructions will be fetched before the 


change in the wait-state occurs. 
ee | 
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Table 10-3. Wait-State Generation 


Inputs Output 
SWW Bit 
Field /RDYext /RDYwtcnt /RDYint Functional Description 
00 0 X 0 Wait until external RDY is signaled 
1 X 1 
01 x 0 0 Wait until internal wait state generator counts 
- 1 1 down to 0 
10 0 0 0 Wait until first signal: external RDY or the 
{ 0 internal wait state generator (logical OR) 
1 0 0 
1 1 1 
11 0 0 0 Wait until both external RDY is signaled and 
{ 1 wait state generator counts down to 0 (logical 
AND) 
1 0 1 
1 1 1 
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10.5 Programmable Bank Switching 


Programmable bank switching allows you to switch between external memory 
banks without having to insert wait states externally due to memories that require 
several cycles to turn off. Bank switching is implemented on STRBO and STRB1 
only. 


The size of a bank is determined by the number of bits specified to be examined 
on the BNKCMP field of the primary bus control register. For example, if 
BNKCMP = 16, the 16 MSBs of the address are used to define a bank (see 
Figure 9-4). Since addresses are 24 bits, the bank size is specified by the eight 
LSBs, yielding a bank size of 256 words. If BNKCMP = 16, only the 16 MSBs 
are compared. Bank sizes from 28 = 256 to 224 = 16M are allowed. Table 9-6 
summarizes the relationship between BNKCMP, the address bits used to define 
a bank, and the resulting bank size. 


Figure 10-8. BNKCMP Example 


\4 24-bit address >| 


23 al 7 0 


Po 


——— Number of bits to compare pla Defines bank size —) 
Table 10-4. BNKCMP and Bank Size 


BNKCMP MSBs Defining a Bank Bank Size (32-Bit Words) 
00000 None 224 = 16M 
00001 23 223 = 8M 
00010 23-22 222 = 4M 
00011 23-21 221 = 2M 
00100 23-20 220 - 1M 
00101 23-19 219 = 512K 
00110 23-18 218 — 256K 
00111 23-17 217 = 128K 
01000 23-16 216 = 64K 
01001 23-15 215 = 32K 
01010 23-14 214 = 16K 
01011 23-13 213 = 8K 
01100 23-12 212 ~ 4K 
01101 23-11 211 = 2K 
01110 23-10 210 = 1K 
01111 23-9 29 = 512 
10000 23-8 28 = 256 
10001-11111 Reserved Undefined 
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The ’C3x has an internal register that contains the MSBs (as defined by the 
BNKCMP field) of the last address used for a read or write over the primary inter- 
face. At reset, the register bits are set to 0. If the MSBs of the address being used 
for the current primary interface read do not match those contained in this inter- 
nal register, a read cycle is not asserted for one H1/H3 clock cycle. During this 
extra clock cycle, the address bus switches over to the new address, but STRB 
is inactive (high). The contents of the internal register are replaced with the 
MSBs being used for the current read of the current address. If the MSBs of the 
address being used for the current read match the bits in the register, a normal 
read cycle takes place. 


If repeated reads are performed from the same memory bank, no extra cycles 
are inserted. When a read is performed from a different memory bank, memory 
conflicts are avoided by the insertion of an extra cycle. This feature can be 
disabled by setting BNKCMP to 0. The insertion of the extra cycle occurs only 
when a read is performed. The changing of the MSBs in the internal register 
occurs for all reads and writes over the primary interface. 


Figure 9-5 shows the addition of an inactive cycle when switches between 
banks of memory occur. 


Figure 10-9. Bank-Switching Example 


R/W 
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ccc ccc | 
Note: 


After changing BNKCMP, up to three instructions are fetched before the 


change in bank size occurs. 
ee | 
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10.6 32-Bit-Wide Memory Interface 


The ’C32 memory interface to 32-bit-wide external memory uses STRBx_B3 
through STRBx_BO pins as strobe-byte-enable pins as shown in Figure 10-10. 
In this manner, the C32 can read from, or write to, a single 32-, 16-, or 8-bit value 
from the external 32-bit-wide memory. 


Figure 10-10. TMS320C32 External Memory Interface for 32-Bit SRAMs 


STRBx_B3 


STRBx_B2 
STRBx_B1 
STRBx_BO 


D(31-24) 
D(23-16) 
D(15-8) 
D(7-0) 


/0(7-0) /0(7-0) /0(7-0) /0(7-0) 


Case 1: 32-Bit-Wide Memory With 8-Bit Data-Type Size 
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When the data-type size is 8 bits, the ‘C32 shifts the internal address two bits 
to the right before presenting it to the external-address pins. In this shift, the 
memory interface copies the value of the internal-address Ao3 to the external- 
address pins Ao3, Aso, and Az}. The memory interface activates the STRBx_B3 
through STRBx_BO pins according to the value of the internal address bits A, 
and Ay as shown in Table 10-5. Figure 10-11 shows a functional diagram of 
the memory interface for 32-bit-wide memory with an 8-bit data-type size. 


32-Bit-Wide Memory Interface 


Table 10-5. Strobe Byte-Enable for 32-Bit-Wide Memory With 8-Bit Data-Type Size 


Internal Internal Active Strobe 
Ay Ao Byte Enable 
0 0 STRBx_BO 
0 1 STRBx_B1 
1 0 STRBx_B2 
1 1 STRBx_B3 


Figure 10-11. Functional Diagram for 8-Bit Data-Type Size and 32-Bit External-Memory 
Width 


Memory interface 


/0(7-0) /0(7-0) 


STRBx_B3 —~ 
STRBx_B2 
STRBx_B1 
STRBx_BO 
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For example, reading from or writing to memory locations 904000h to 
90 4004h involves the pins listed in Table 10-6. 


Table 10-6. Example of 8-Bit Data-Type Size 


Internal External Active Strobe Accessed 


Address Bus Address Pins Byte Enable Data Pins 
904000h E41000h STRB1_BO D7_9 
904001h E41000h STRB1_B1 Di5_8 
904002h E41000h STRB1_B2 Do3_16 
904003h E41000h STRB1_B3 D31_24 
904004h E41001h STRB1_BO D7_9 


Case 2: 32-Bit-Wide Memory With 16-Bit Data-Type Size 


When the data-type size is 16 bits, the ‘C32 shifts the internal address one bit 
to the right before presenting it to the external-address pins. In this shift, the 
memory interface copies the value of the internal-address Ao3 to the external- 
address pins As3 and Ass. Also, the memory interface activates the STRBX-B3 
through STRBx_BO pins according to the value of the internal address bit Ap 
as shown in Table 10-7. Figure 10-12 shows a functional diagram of the 
memory interface for 32-bit-wide memory with 16-bit data-type size. 


Table 10-7. Strobe Byte-Enable for 32-Bit-Wide Memory With 16-Bit Data-Type Size 


Internal Ap Active Strobe Byte Enable 
0 STRBx_B1 and STRBx_BO 
1 STRBx_B3 and STRBx_B2 
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Figure 10-12. Functional Diagram for 16-Bit Data-Type Size and 32-Bit External-Memory 
Width 


C32 


Memory interface 


3 A283 A23 
hoo A22 
2 AZ1— Aat 
5 A20—- A20 
Ss Aig 
o . . 
go. . 

mo Ag ; 
o Ai AY 
O Ao Ao 


STRBx_B3 — 


STRBx_B2 


\ 


STRBx_B1 


STRBx_BO 


AAAA 


For example, reading or writing to memory locations 904000h to 904004h 
involves the pins listed in Table 10-8. 


Table 10-8. Example of 16-Bit Data-Type Size and 32-Bit-Wide External Memory 


Internal External 


Address Bus Address Pins 


904000h 
904001h 
904002h 
904003h 
904004h 


C82000h 
C82000h 
C82001h 
C82001h 
C82002h 


Active Strobe Byte Enable 


STRB1_B1 and STRB1_BO 
STRB1_B3 and STRB1_B2 
STRB1_B1 and STRB1_BO 
STRB1_B3 and STRB1_B2 
STRB1_B1 and STRB1_BO 


Accessed 
Data Pins 


D150 
D31-16 
D15-0 
D31_16 


D15_0 
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Case 3: 32-Bit-Wide Memory With 32-Bit Data-Type Size 


When the data size is 32 bits, the ‘C32 does not shift the internal address 
before presenting it to the external address pins. In this case, the memory 
interface copies the value of the internal address bus to the respective external- 
address pins. Also, the memory interface activates STRBx_B3 through 
STRBx_BO pins during accesses. Figure 10-13 shows a functional diagram 
of the memory interface for 32-bit-wide memory with 32-bit data size. 


Figure 10-13. Functional Diagram for 32-Bit Data Size and 32-Bit External-Memory Width 
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32-Bit-Wide Memory Interface 


For example, reading or writing to memory locations 904000h to 904004h 
involves the pins listed in Table 10-9. 


Table 10-9. Example of 32-Bit-Wide Memory With 32-Bit Data-Type Size 


Internal External Accessed 
Address Bus’ Address Pins Active Strobe Byte Enable Data Pins 
904000h 904000h STRB1_BO, STRB1_B1, D31_9 


STRB1_B2, and STRB1_B3 


904001h 904001h STRB1_BO, STRB1_B1, D31_0 
STRB1_B2, and STRB1_B3 


904002h 904002h STRB1_BO, STRB1_B1, D31_0 
STRB1_B2, and STRB1_B3 


904003h 904003h STRB1_BO, STRB1_B1, D31_9 
STRB1_B2, and STRB1_B3 


904004h 904004h STRB1_BO, STRB1_B1, D31_9 
STRB1_B2, and STRB1_B3 


TMS320C32 Enhanced External Memory Interface 10-25 


16-Bit-Wide Memory Interface 


10.7 16-Bit-Wide Memory Interface 


The ’C32 memory interface to 16-bit-wide external memory uses STRBx_B3 pin 
as an additional address pin, A_;, while using STRBx_BO and STRBx_B1 as 
strobe byte-enable pins as shown in Figure 10-14. Note that the external- 
memory address pins are connected to the C32 address pins AzsApo1...A;AgA_1. 
In this manner, the ’C32 can read/write a single 32-, 16-, or 8-bit value from the 
external 16-bit-wide memory. 


Figure 10-14. External-Memory Interface for 16-Bit SRAMs 
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Case 4: 16-Bit-Wide Memory With 8-Bit Data-Type Size 
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When the data type size is 8 bits, the ‘C32 shifts the internal address two bits 
to the right before presenting it to the external-address pins. In this shift, the 
memory interface copies the value of the internal-address Ao; to the external- 
address pins Ao3, Ago, and Ao;. The memory interface also copies the value 
of the internal-address A, to the external STRBx_B3/A_, pin. Furthermore, the 
memory interface activates the STRBx_B1 and STRBx_BO pins according to 
the value of the internal address bit Ap as shown in Table 10-10. Figure 10-15 
shows a functional diagram of the memory interface for 16-bit-wide memory 
with 8-bit data-type size. 


16-Bit-Wide Memory Interface 


Table 10-10. Strobe-Byte Enable Behavior for 16-Bit-Wide Memory with 8-Bit Data-Type Size 


Figure 10-15. 


"C32’s core address bus 
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Functional Diagram for 8-Bit Data-Type Size and 16-Bit External-Memory Width 
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For example, reading or writing to memory locations 4000h to 4004h involves 
the pins listed in Table 10-11. 
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Table 10-11. Example of 8-Bit Data-Type Size and 16-Bit-Wide External Memory 


Internal 
Address Bus 


4000h 
4001h 
4002h 
4003h 
4004h 


External 


Address Pins 


1000h 
1000h 
1000h 
1000h 
1001h 


0 
0 
1 
1 
0 


Accessed 
STRBO_B3/A_ 


Active Strobe Byte Enable Data Pins 
STRBO_BO D7_9 
STRBO_B1 Di5-8 
STRBO_BO D7_9 
STRBO_B1 Di5-8 
STRBO_BO D7_0 


Case 5: 16-Bit-Wide Memory With 16-Bit Data-Type Size 
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When the data-type size is 16 bits, the ’C32 shifts the internal address one bit 
to the right before presenting it to the external address pins. In this shift, the 
memory interface copies the value of the internal-address Ao3 to the external- 
address pins As3 and Ago. Also, the memory interface copies the value of the 
internal-address A, to the external STRBx_B3/A_, pin. Moreover, the memory 
interface activates the STRBx_B1 and STRBx_BO during accesses. 
Figure 10-16 shows a functional diagram of the memory interface for 16-bit- 
wide memory with 16-bit data-type size. 


16-Bit-Wide Memory Interface 


Figure 10-16. Functional Diagram for 16-Bit Data-Type Size and 16-Bit External-Memory 
Width 
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For example, reading or writing to memory locations 4000h to 4004h involves 
the pins listed in Table 10-12. 


Table 10-12. Example of 16-Bit-Wide Memory With 16-Bit Data-Type Size 


Internal 
Address Bus 


4000h 
4001h 
4002h 
4003h 
4004h 


External 


Address Pins 


2000h 
2000h 
2001h 
2001h 
2002h 


STRBO_B3/A_4 


0 
1 
0 


STRBO_BO and STRBO_B1 
STRBO_BO and STRBO_B1 
STRBO_BO and STRBO_B1 
STRBO_BO and STRBO_B1 


Active Strobe Byte Enable 
STRBO_BO and STRBO_B1 


Accessed 
Data Pins 


D15-0 
D150 
D150 
D150 


D15-0 
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Case 6: 16-Bit-Wide Memory with 32-Bit Data-Type Size 


When the data type size is 32 bits, the ‘C32 does not shift the internal address 
before presenting it to the external address pins. In this case, the memory 
interface copies the value of the internal address bus to the respective external 
address pins. The memory interface also toggles STRBx_B3/A_, twice to 
perform two 16-bit memory accesses. In the consecutive memory accesses, 
the memory interface activates STRBx_B1 and STRBx_BO. In summary, the 
memory interface seems to add one wait state to the 32-bit data access. 
Figure 10-17 depicts a functional diagram of the memory interface for 16-bit 
wide memory with 32-bit data type size. 


Figure 10-17. Functional Diagram for 32-Bit Data-Type Size and 16-Bit External-Memory 
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For example, reading or writing to memory locations 4000h to 4004h involves 
the pins listed in Table 10-13. 


16-Bit-Wide Memory Interface 


Table 10-13. Example of 16-Bit-Wide Memory With 32-Bit Data-Type Size 


Internal External Accessed 


Address Bus AddressPins §STRBO_B3/A_, Active Strobe Byte Enable Data Pins 
4000h 4000h 0 STRBO_BO and STRBO_B1 Di5-0 
4000h 1 STRBO_BO and STRBO_B1 Di5-0 
4001h 4001h 0 STRBO_BO and STRBO_B1 D45_0 
4001h 1 STRBO_BO and STRBO_B1 D45_0 
4002h 4002h 0 STRBO_BO and STRBO_B1 D45_0 
4002h 1 STRBO_BO and STRBO_B1 D45_0 
4003h 4003h 0 STRBO_BO and STRBO_Bt Di5_0 
4003h 1 STRBO_BO and STRBO_Bt Di5_0 
4004h 4004h 0 STRBO_BO and STRBO_B1 D45_0 
4004h 1 STRBO_BO and STRBO_B1 D15-0 
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10.8 8-Bit-Wide Memory Interface 


Figure 10-18. 


’C32 memory interface to an 8-bit wide external memory uses STRBx_B3 and 
STRBx_B2 pins as additional address pins, A_y and A_», respectively, while 
using STRBx_BO as strobe byte-enable pin as shown in Figure 10-18. The 
external-memory address pins are connected to the ’C32’s address pins 
Ao1Ago..-AyAgA_;A_oa. In this manner, the ’C32 can read/write a single 32-, 16-, 
or 8-bit value from the external 8-bit-wide memory. 


External Memory Interface for 8-Bit SRAMs 
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Case 7: 8-Bit-Wide Memory With 8-Bit Data-Type Size 
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Similarly to case 4, the ‘C32 shifts the internal address two bits to the right 
before presenting it to the external-address pins when the data type is 8-bit. As 
in case 4, the memory interface copies the value of the internal-address Ag: to 
the external-address pins Ao3, Aso, and As. But in case 7, the memory interface 
also copies the value of the internal-address A, to the external STRBx_B3/A_ 
pin and the value of Ag to the external STRBx_B2/A_». Moreover, the memory 
interface only activates the STRBx_BO pin during the external memory access. 
Figure 10-19 shows a functional diagram of the memory interface for 8-bit-wide 
memory with an 8-bit data-type size. 


8-Bit-Wide Memory Interface 


Figure 10-19. Functional Diagram for 8-Bit Data-Type Size and 8-Bit External-Memory 
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For example, reading or writing to memory locations A04000h to A04004h 
involves the pins listed in Table 10-14. 


Table 10-14. Example of 8-Bit-Wide Memory With 8-Bit Data-Type Size 


Internal 
Address Bus 


A04000h 
A04001h 
A04002h 
A04003h 
A04004h 


External 
Address Pins 


E81000h 
E81000h 
E81000h 
E81000h 
E81001h 


Active Strobe 


STRBO_B3/A_; STRBO_B3/A_p Byte Enable 


0 0 STRB1_BO 
0 1 STRB1_BO 
1 0 STRB1_BO 
1 1 STRB1_BO 
0 0 STRB1_BO 


Accessed 
Data Pins 


D7_9 
D7_9 
D7_9 
D7_9 
D7_9 
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Case 8: 8-Bit Wide Memory With 16-Bit Data-Type Size 


When the data-type size is 16 bits, the ‘C32 shifts the internal address one bit 
to the right before presenting it to the external-address pins. In this shift, the 
memory interface copies the value of the internal-address Ao3 to the external- 
address pins Ao3 and Ass. Also, the memory interface copies the value of the 
internal-address Ag to the external STRBx_B3/A_; pin. Furthermore, the 
memory interface toggles STRBx_B2/A_» twice to perform two 8-bit memory 
accesses. Moreover, the memory interface activates the STRBx_BO during 
accesses. In summary, the memory interface adds one wait state to the 16-bit 
data access. Figure 10-20 shows a functional diagram of the memory inter- 
face for 8-bit-wide memory with 16-bit data-type size. 


Figure 10-20. Functional Diagram for 16-Bit Data-Type Size and 8-Bit External-Memory 
Width 
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8-Bit-Wide Memory Interface 


For example, reading or writing to memory locations A04000h to A04002h 
involves the pins listed in Table 10-15. 


Table 10-15. Example of 8-Bit-Wide Memory With 16-Bit Data-Type Size 


Internal External === ——— Active Strobe Accessed 
Address Bus Address Pins °'RB0_B3/A_, STRBO_B3/A_> Byte Enable Data Pins 
A04000h DO02000h 0 0 STRB1_BO D7_9 
DO2000h 0 1 STRB1_BO D7_o 
A04001h DO02001h 0 STRB1_BO D7_9 
DO02001h 1 STRB1_BO D7_9 
A04002h D02002h 0 0 STRB1_BO D7_9 
D02002h 0 1 STRB1_BO D7_9 


Case 9: 8-Bit-Wide Memory With 32-Bit Data-Type Size 


When the data-type size is 32 bits, the ‘C32 does not shift the internal address 
before presenting it to the external-address pins. In this case, the memory 
interface copies the value of the internal-address bus to the respective external- 
address pins. The memory interface also toggles STRBx_B3/A_; and 
STRBx_B2/A_» to perform four 8-bit memory accesses. In the consecutive 
memory accesses, the memory interface activates STRBx_BO. In summary, 
the memory interface adds three wait states to the 32-bit data access. 
Figure 10-21 shows a functional diagram of the memory interface for 8-bit- 
wide memory with 32-bit data-type size. 
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Figure 10-21. Functional Diagram for 32-Bit Data-Type Size and 8-Bit External-Memory 
Width 
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8-Bit-Wide Memory Interface 
For example, reading or writing to memory locations A04000h to A04001h 
involves the pins listed in Table 10-16. 


Table 10-16. Example of 32-Bit Data-Type Size and 8-Bit-Wide Memory 


Internal External =e —_ Active Strobe Accessed 


Address Bus AddressPins °!RBO_B3/A_, = STRBO_B3/A_p Byte Enable Data Pins 

A04000h A04000h 0 0 STRB1_BO D7_9 
A04000h 0 1 STRB1_BO D7_9 
A04000h 1 0 STRB1_BO D7_9 
A04000h 1 1 STRB1_BO D7_9 

A04001h A04001h 0 0 STRB1_BO D7_9 
A04001h 0 1 STRB1_BO D7_9 
A04001h 1 0 STRB1_BO D7_9 
A04001h 1 1 STRB1_BO D7_9 
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10.9 External Ready Timing Improvement 


The ready (RDY) timing should relate to the H1 low signal as shown in 
Figure 10-22. This is equivalent to the ’C4x ready timing, which increases the 
time between valid address and the sampling of RDY. This facilitates the memory 
hardware interface by allowing a longer address decode-circuit response time to 
generate a ready signal. 


Figure 10-22. RDY Timing for Memory Read 
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10.10 Bus Timing 


This section discusses functional timing of operations on the external memory 
bus. Detailed timing specifications are contained in the TMS320C32 Data 
Sheet. The timing of STRBO and STRB1 bus cycles is identical and discussed 
in subsection 10.10.1. The abbreviation STRBx is used in references that per- 
tain equally to STRBO and STRB1. The IOSTRB bus cycles are timed differently 
and are discussed in subsection 10.10.2. 


10.10.1 STRBO and STRB1 Bus Cycles 


All bus cycles comprise integral numbers of H1 clock cycles. One H1 cycle is 
defined from one falling edge of H1 to the next falling edge of H1. For full speed 
(zero wait-state) accesses on STRBO and STRB1, writes take two H1 cycles 
and reads take one cycle. However, if the read immediately follows a write, the 
read takes two cycles. Writes to internal memory take one cycle if no other 
accesses to that interface are in progress. The following discussion pertains 
to zero wait-state accesses, unless otherwise specified. 


The STRBx signal is low for the active portion of both reads and writes (one 
H1 cycle). Additionally, before and after the active portions of writes only 
(STRBx low), there is a transition of one H1 cycle. During this transition cycle 
the following might occur: 


J STRBx is high. 


C) If required, R/W changes state on the rising edge of H1. 


_j If required, address changes on the rising edge of H1 if the previous H1 
cycle performed a write. If the previous H1 cycle performed a read, 
address changes on the falling edge of H1. 


Figure 10-23 illustrates a zero wait-state read-read-write sequence for STRBx 
active. The data is read as late in the cycle as possible to allow for the maximum 
access time from address valid. Although external writes take two cycles, writes 
to internal memory take one cycle if no other accesses to that interface are in 
progress. Similar to typical external interfaces, the R/W signal does not change 
until STRBO and STRB1 are deactivated. 
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Figure 10-23. Read-Read-Write Sequence for STRBx Active 
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STRBx 


R/IW 


RDY 


Figure 10-24 shows a zero wait-state write-write-read sequence for STRBx 
active. During back-to-back writes, the data is valid when STRBx changes for the 
first write, but for subsequent writes the data is valid when the address changes. 


Figure 10-24. Write-Write-Read Sequence for STRBx Active 
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Figure 10-25 shows a one wait-state read sequence and Figure 10-26 shows 
the write sequence for STRBx active. On the first H1 cycle, RDY is high; therefore, 
the read or write sequence is extended for one extra cycle. On the second H1 
cycle, RDY is low and the read or write sequence is terminated. 


Figure 10-25. One Wait-State Read Sequence for STRBx Active 
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Figure 10-26. One Wait-State Write Sequence for STRBx Active 


H3 
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10.10.2 IOSTRB Bus Cycles 


In contrast to STRBO and STRB1 bus cycles, IOSTRB full speed (zero wait- 
state) reads and writes consume two H1 cycles. During these cycles, the 
IOSTRB signal is low from the rising edge of the first H1 cycle to the rising edge 
of the second H1 cycle. Also, the address changes on the falling edge of the 
first H1 cycle and R/W changes state on the falling edge of H1. This provides 
a valid address to peripherals that may change their status bits when read or 
written while IOSTRB is active. Moreover, the IOSTRB signal is high between 
IOSTRB read and write cycles. 
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Figure 10-27 illustrates a zero wait-state read and write sequence for IOSTRB 
active. During writes, the data is valid when IOSTRB changes. 


Figure 10-27. Zero Wait-State Read and Write Sequence for IOSTRB Active 
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Figure 10-28 depicts a one wait-state read sequence for IOSTRB active. 
Figure 10-29 shows a one wait-state write sequence for IOSTRB active. For 
each wait-state added, IOSTRB, R/W, and A are extended for one extra clock 
cycle. Writes hold the data on the bus for one extra clock cycle. RDY is sampled 
on each extra cycle and the sequence is terminated when RDY is low. 
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Figure 10-28. One Wait-State Read Sequence for IOSTRB Active 
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Figure 10-29. One Wait-State Write Sequence for IOSTRB Active 
H3 
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Figure 10-30 and Figure 10-31 illustrate the transitions between STRBx 
reads and IOSTRB writes and reads, respectively. In these transitions, the ad- 
dress changes on the falling edge of the H1 cycle. 
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Figure 10-30. STRBx Read and lIOSTRB Write 
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Figure 10-31. STRBx Read and IOSTRB Read 
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Figure 10-32 and Figure 10-33 illustrate the transitions between STRBx 
writes and IOSTRB writes and reads, respectively. In these transitions, the 
address changes on the falling edge of the H3 cycle. 


Figure 10-32. STRBx Write and IOSTRB Write 
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Figure 10-33. STRBx Write and IOSTRB Read 
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Figure 10-34 through Figure 10-37 show the transitions between IOSTRB 
writes/reads and STRBx writes/reads. In these transitions, the address 
changes on the rising edge of the H3 cycle. 


Figure 10-34. IOSTRB Write and STRBx Write 
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Figure 10-35. IOSTRB Write and STRBx Read 
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Figure 10-36. IOSTRB Read and STRBx Write 
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Figure 10-37. IOSTRB Read and STRBx Read 
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Figure 10-38 through Figure 10-40 illustrate the transitions between reads 
and writes. 
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Figure 10-38. IOSTRB Write and Read 
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Figure 10-39. IOSTRB Write and Write 
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Figure 10-40. IOSTRB Read and Read 
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10.10.3 Inactive Bus States 


Figure 10-41 and Figure 10—42 show the signal states when a bus becomes 
inactive after an IOSTRB or STRBx, respectively. The strobes (STRBO, 
STRB1, IOSTRB, and R/W) are deasserted going to a high level. The address 
bus preserves the last value and the ready signal (RDY) is ignored. 


Figure 10-41. Inactive Bus States Following IOSTRB Bus Cycle 
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Figure 10-42. Inactive Bus States Following STRBx Bus Cycle 
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Chapter 11 


Using the TMS320C31/VC33 and 
TMS320C32 Boot Loaders 


The 'C31/VC33 and ’C32 have on-chip boot loaders that can load and execute 
programs received from a host processor, standard memory devices (includ- 
ing EPROM), or via serial port. 
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TMS320C31/VC33 Boot Loader 


11.1 TMS320C31/VC33 Boot Loader 


This section describes how to use the ’'C31/VC33 microcomputer/boot loader 
(MCBL/MP) function. This feature is unique to the ‘C31/VC33 and ’C32, and is 
not available on the ’C30 devices. 


11.1.1 TMS320C31/VC33 Boot-Loader Description 


The boot loader lets you load and execute programs that are received from a 
host processor, inexpensive EPROMs, or other standard memory devices. The 
programs to be loaded reside in one of three memory-mapped areas identified 
as Boot 1, Boot 2, and Boot 3 (see the shaded areas of Figure 4-2 on page 4-6), 
or they are received by means of the serial port. 


The boot loader supports user-definable byte, half-word, and word-data formats, 
as well as 32-bit fixed-burst loads from the ’C31/VC33 serial port. See Section 
12.2, Serial Ports, on page 12-15 for a detailed description of the serial-port op- 
eration. 


The boot-loader code starts at location 0x45 in the on-chip ROM. The source 
code is supplied in Appendix B. 


11.1.2 TMS320C31/VC33 Boot-Loader Mode Selection 


The ’C31/VC33 boot loader functions as a memory boot loader or a serial-port 
boot loader. The boot-loader function is selected by resetting the processor 
while driving the MCBL/MP pin high. Use interrupt pins INT3 — INTO to select 
the boot-load operation. Figure 11-1 shows the flow of this operation, which de- 
pends on the mode selected (external memory or serial boot). 


(1 The memory boot loader supports user-definable byte, half-word, and full- 
word data formats, allowing the flexibility to load a source program from 
memories having widths of 8-, 16-, or 32 bits. The source program must 
reside in one of three memory locations as listed in Table 11-1. 
Figure 11-2 shows the memory boot-loader flow. 


(1 The serial-port boot loader supports 32-bit fixed-burst transfers, with 
externally generated serial-port clock and frame-sync signals. The format 
of the incoming data stream is similar to that of the memory boot loader, 
except the source memory width and memory configuration word are 
omitted. Figure 11-3 shows the serial-port boot-loader flow. 
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Table 11-1. Boot-Loader Mode Selection 


INTO INT? INT2 INT3_ Loader Mode Memory Addresses 
1 1 1 External memory Boot 1 address 0x001000 
1 0 1 1 External memory Boot 2 address 0x400000 
1 1 0 1 External memory Boot 3 address OxFFFO0O 
1 1 1 0 32-bit serial Serial port 0 


Figure 11-1. TMS320C31/VC33 Boot-Loader Mode-Selection Flowchart 
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11.1.3 TMS320C31/VC33 Boot-Loading Sequence 


The following is the sequence of events that occur during the boot load of a 
source program. Table 11-2 shows the structure of the source program. 


1) 


Select the boot loader by resetting the ‘C31/VC33 while driving the 
MCBL/MP pin high and the corresponding INT3-INTO pin low. The 
MCBL/MP must stay high during boot loading, but can be changed any- 
time after boot loading has terminated. No reset is necessary when chang- 
ing the INT3-INTO pin, as long as the ’C31/VC33 is not accessing the 
overlapping memory (Oh—FFFh) during this transition (see Section 
11.1.6). The INT3-INTO pin can be driven low anytime after deasserting 
the RESET pin (driven low and then high). 


The status of the interrupt flag (IF) register’s INT3—INTO bit fields dictate 
the boot-loading mode. The bits are polled in the order described in the 
flow chart in Figure 11-1. 


If only the IF register’s INT3 bit field is set, the boot loader configures the 
serial port for 32-bit fixed-burst mode reads with an externally generated 
serial-port clock and FSR. Then, it proceeds to boot load the source pro- 
gram from the serial port. The transferred data-bit order supplied to the 
serial port must begin with the most significant bit (MSB) and end with the 
least significant bit (LSB). Figure 11-3 depicts the boot-loader serial-port 
flow. 


Otherwise, the boot loader attempts a memory boot load. Figure 11-2 shows 
the boot-loader memory flow. If the IF register’s INTO bit field is set, the 
source program is loaded from memory location 1000h. If the IF register’s 
INT1 bit field is set, the source program is loaded from memory location 
400000h. If the IF register’s INT2 bit field is set, the source program is loaded 
from memory location FFFOOOh. 


The memory boot-load source program has a header indicating the boot 
memory width and memory configuration control word. This word is copied 
into the STRB control register to configure the external primary bus interface. 


After reading the header, the boot loader copies the source-program blocks. 
The source-program blocks have two entries preceding the source-program- 
block data. The first entry in the source-program block indicates the size of 
the block. A block size of zero signals the end of the source program code. 
The second entry indicates the address where the block is to be loaded. The 
boot loader cannot load the source program to any memory address below 
1000h, unless the address decode logic is remapped. 


The boot loader branches to the destination address of the first source 
block loaded and begins program execution. 


Figure 11-2.Boot-Loader Memory-Load Flowchart 
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Figure 11—3.Boot-Loader Serial-Port Load-Mode Flowchart 
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11.1.4 TMS320C31/VC33 Boot Data Stream Structure 


Table 11-2 shows the data stream structure. The data stream is composed of 
a header of 1 (serial-port load) or 2 (memory load) words and one or more blocks 
of source data. The boot loader uses this header to determine the physical 
memory width where the source program resides (memory load) and to configure 
the primary bus interface before source program boot load. The blocks of 
source data have two entries in addition to the raw data. The first entry in this 
block indicates the size of the block. The second entry in this block indicates the 
memory address where the boot loader copies this source block. Words 5 
through n of the shaded entries in Table 11-2 contain the source data for the first 
block. 
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Table 11-2. Source Data Stream Structure 


Wordt Content Valid Data Entries 
1 Memory width (8, 16, or 32 bits) where source program resides 8h, 10h, or 20h, respectively 
2 Value to set the STRB control register See subsection 10.7 


3 Size of first data block. The block size is the number of 32-bit 0<size < 224 
words in the data block. A 0 in this entry signifies the end of the 
source data stream 
4 Destination address to load the first block A valid °C31/VC33 24-bit address 


5 First word of first block A’C31/VC33 valid instruction or any 
32-bit wide data value (LSB first) 


n Last word of first block A ’C31/VC33 valid instruction or any 
32-bit wide data value 


m Size of last data block. The block size is the number of 32-bit 0 < size < 224 
words in the data block. If the next word following this block is 
not 0, another block is loaded. 
m+1 Destination address to load the last block A valid °C31/VC33 24-bit address 


m+2_ First word of last block A ’C31/VC33 valid instruction or any 
32-bit wide data value (LSB first) 


j Last word of last source block 


j+1 Zero word. If more than one source block was read, word j Oh 
would be the last word of the last block. Each block consists 
of header and data portions. The block’s header is shaded 
darker than the block’s data section. 


T Words 1 and 2 do not exist in serial-port boot load since the source program does not reside in memory. 


Each source block of data can be loaded to different memory locations. Each 
block specifies its own size and destination address. The last source block of 
the data stream is appended with a zero word. 
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11.1.4.1_ Examples of External TMS320C31/VC33 Memory Loads 


Table 11-3, Table 11-4, and Table 11-5 show memory images for byte-wide, 
16-bit-wide, and 32-bit-wide configured memory (see Figure 4—2 on page 4-6). 


These examples assume the following: 


_} An INTO signal was detected after reset was deasserted (signifying an 
external memory load from boot 1). 


(j The source program header resides at memory location 0x1000 and 
defines the following: 


Hm Boot memory-type EPROMs that require two wait states and SWW = 11 


m A loader destination address at the beginning of the ’C31/VC33 in- 
ternal RAM block 14 


m Asingle block of memory that is Ox1FF in length 


Table 11-3. Byte-Wide Configured Memory 


Address Value Comments 

0x1000 0x08 Memory width = 8 bits 

0x1001 0x00 

0x1002 0x00 

0x1003 0x00 

0x1004 0x58 Memory type = SWW = 11, WCNT = 2 
0x1005 0x10 

0x1006 0x00 

0x1007 0x00 

0x1008 OxFF Program block size in words = Ox1FF 
0x1009 0x01 

0x100A 0x00 

0x100B 0x00 

0x100C 0x00 Program load starting address = 0x809C00 
0x100D Ox9C 

0x100E 0x80 

0x100F 0x00 
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Table 11-4. 


16-Bit-Wide Configured Memory 


Address 
0x1000 


0x1001 
0x1002 
0x1003 
0x1004 
0x1005 
0x1006 
0x1007 


Value 
0x10 


0x0000 
0x1058 
0x0000 
Ox1FF 

0x0000 
O0x9C00 
0x0080 


Table 11-5. 32-Bit-Wide Configured Memory 
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Address 
0x1000 


0x1001 
0x1002 
0x1003 


Value 
0x00000020 


0x00001058 
0x000001FF 
0x00809C00 


Comments 


Memory width = 16 


Memory type = SWW = 11, WCNT = 2 


Program block size in words = 0x1FF 


Program load starting address = 0x809C00 


Comments 
Memory width = 32 
Memory type = SWW = 11, WCNT = 2 


Program block size in words = 0x1FF 


Program load starting address = 0x809C00 


After reading the header, the loader transfers Ox IFF 32-bit words, beginning at 
a specified destination address 0x 809C00. Code blocks require the same 
byte and half-word ordering conventions. The loader can also load multiple 
code blocks at different address destinations. 


After loading all code blocks, the boot loader branches to the destination address 
of the first block loaded and begins program execution. Consequently, the first 
code lock loaded is a start-up routine to access the other loaded programs. 


It is assumed that at least one block of code is loaded when the 
loader is invoked. Initial loader invocation with a block size of 
0x00000000 produces unpredictable results. 
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11.1.4.2 Serial-Port Loading 


Boot loads, by way of the ’C31/VC33 serial port, are selected by driving the 
INT3 pin active (low) following reset. The loader automatically configures the 
serial port for 32-bit fixed-burst-mode reads. It is interrupt-driven by the frame 
synchronization receive (FSR) signal. You cannot change this mode for boot 
loads. Your hardware must generate the serial-port clock and FSR externally. 


As with memory loads, a header must precede the actual program to be loaded. 
However, you need only supply the block size and destination address because 
the loader and your hardware have predefined serial-port speed and data format 
(that is, skip data words 0 and 1). 


The transferred data-bit order must begin with the MSB and end with the LSB. 


11.1.5 Interrupt and Trap-Vector Mapping 


Unlike the microprocessor mode, the microcomputer/boot-loader (MCBL) 
mode uses a dual-vectoring scheme to service interrupt and trap requests. Dual 
vectoring was implemented to ensure code compatibility with future versions of 
’C8x devices. 


In a dual-vectoring scheme, branch instructions to an address, rather than 
direct-interrupt vectoring, are used. The normal interrupt and trap vectors are 
defined to vector to the last 63 locations in the on-chip RAM, starting at address 
809FC1h. When the loader is invoked, the interrupt vector table is remapped 
by the processor to the last 63 locations in RAM block 1 of the ’C31/VC33. 
These locations are assumed to contain branch instructions to the interrupt 
source routines. 


Make sure that these locations are not inadvertently overwritten by 
loaded program or daia values. 


Table 11-6 shows the MCBL/MP mode interrupt and trap instruction memory 
maps. 
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Table 11-6. TMS320C31/VC33 Interrupt and Trap Memory Maps 


Address Description 
809FC1 INTO 

809FC2 INT4 

809FC3 INT2 

809FC4 INT3 

809FC5 XINTO 

809FC6 RINTO 

809FC7 XINT1 (Reserved) 
809FC8 RINT1 (Reserved) 
809FC9 TINTO 

809FCA TINT1 

809FCB DINTO 


809FCC-809FDF Reserved 


809FE0O TRAPO 
809FE1 TRAP1 
e e 

e e 

e e 
809FFB TRAP27 


809FFC-809FFF Reserved 
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11.1.6 TMS320C31/VC33 Boot-Loader Precautions 


The boot loader builds a one-word-deep stack, starting at location 809801h. 


Avoid loading code at location 809801h. 


The interrupt flags are not reset by the boot-loader function. If pending interrupts 
are to be avoided when interrupts are enabled, clear the IF register before 
enabling interrupts. 


The MCBL/MP pin must remain high during the entire boot-loader execution, 
but it can be changed subsequently at any time. The 'C31/VC33 does not need 
to be reset after the MCBL/MP pin is changed. During the change, the 
’C31/VC33 must not access addresses Oh-FFFh. The memory space 
Oh-FFFh will be mapped to external memory three clock cycles after changing 
the MCBL/MP pin. 
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11.2 TMS320C32 Boot Loader Zon: 


This section describes how to use the ’C32 microcomputer/boot loader 
(MCBL/MP) functions. 


11.2.1 TMS320C32 Boot-Loader Description 


The ’C32 boot loader is an enhanced version of that found in the ’C31/VC33. 
The boot loader can load and execute programs received from a host proces- 
sor through standard memory devices (including EPROM), with and without 
handshake, or through the serial port. The ‘C32 boot loader supports 16- and 
32-bit program external memory widths, as well as 8-, 16-, and 32-bit data- 
type sizes and external memory widths. 


The programs to be loaded reside in one of three memory-mapped areas identi- 
fied as Boot 1, Boot 2, and Boot 3 (see shaded areas of Figure 4—3 on page 4-6) 
or they are received by means of the serial port. 


The boot-loader code starts at location 0x45 in the on-chip ROM. The source 
code is supplied in Appendix C. 


11.2.2 TMS320C32 Boot-Loader Mode Selection 


The ’C32 boot loader functions as a memory boot loader, memory boot loader 
with handshake, or a serial-port boot loader. The boot-loader mode selection 
is determined by the status of the INT3-INTO pins immediately following reset. 
Table 11-7 lists the boot-loader modes. 


(1 The memory boot loader supports user-definable byte, half-word, and full- 
word data formats, allowing the flexibility to load a source program from 
memories having widths of 8, 16, and 32 bits with or without handshaking. 
The source programs to be loaded reside in one of the three memory loca- 
tions: 1000h, 810000h, and 900000h (see Table 11-7). 


[J The memory boot-load handshaking mode uses XFO as a data-acknowledge 
signal and XF1 as a data-ready signal. 


Li The serial-port boot loader supports 32-bit fixed-burst loads from the ’C32 
serial port with an externally generated serial-port clock and frame sync 
signals. The format is similar to that of the memory boot loader, except that 
the source memory width is omitted. 
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Table 11-7. Boot-Loader Mode Selection 


INTO INT1 
; 
1 0 
1 1 
1 1 
0 1 
1 0 


INT2 
1 
1 
0 
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INT3 Boot Loader Mode Source Program Location 
1 External memory Boot 1 address 1000h 
1 External memory Boot 2 address 810000h 
1 External memory Boot 3 address 900000h 
0 32-bit fixed-burst serial Serial Port 
0 External memory with handshake Boot 1 address 1000h, 


XFO and XF1 used in handshaking 


Boot 2 address 81 0000h, 
XFO and XF1 used in handshaking 


Boot 3 address 900000h, 
XFO and XF1 used in handshaking 


0 External memory with handshake 


0 External memory with handshake 


11.2.3 TMS320C32 Boot-Loading Sequence 


The following is the sequence of events that occur during the boot load of a 
source program. Table 11-2 shows the structure of the source program. 


1) 


Select the boot loader by resetting the ‘C32 while driving the MCBL/MP pin 
high and the corresponding INT3-INTO pins low. The MCBL/MP must stay 
high during boot loading, but can be changed anytime after boot loading has 

terminated. No reset is necessary when changing the INT3-INTO pin, as 
long as the ’C32 is not accessing the overlapping memory (Oh—FFFh) during 
this transition. In nonhandshake mode, one of the INT3-INTO pins can be 
driven low any time after deasserting the RESET pin (driven low and then 

high). While in handshake mode, two interrupt pins must be asserted before 

deasserting the RESET pin. 


The status of the IF register’s INT3—-INTO bit fields dictates the boot-loading 
mode. The bits are polled in the order described in the flowchart in 
Figure 11-4. 


If only the IF register’s INT3 bit field is set, the boot loader configures the 
serial port for 32-bit fixed burst mode reads with an externally generated 
serial-port clock and FSR. Then, it proceeds to boot load the source pro- 
gram from the serial port. A header indicating the STRBO, STRB1, and 
IOSTRB control registers precedes the actual program (see Table 11-2). 
These header values are loaded into the corresponding locations at the 
completion of the boot-load operation. The transferred data-bit order sup- 
plied to the serial port must begin with the most significant bit (MSB) and 
end with the least significant bit (LSB). Figure 11-5 depicts the boot-loader 
serial-port flow. 
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4) 


Otherwise, the boot loader attempts a memory boot load. Figure 11-6 
shows the boot-loader memory flow. If the IF register’s INTO bit field is set, 
the source program is loaded from memory location 1000h. If the IF regis- 
ter’s INT1 bit field is set, the source program is loaded from memory location 
810000h. If the IF register’s INT2 bit field is set, the source program is 
loaded from memory location 900000h. After determining the memory loca- 
tion of the source program, the boot loader checks INTS bit field in the IF 
register. If this bit is set, all data transfers are performed with synchronous 
handshake. The handshake protocol uses XFO as a data-acknowledge and 
XF1 as a data-ready signals. ’C32’s XFO is an output pin while the XF1 is 
an input pin. Figure 11-7 shows the handshake data-transfer operation. 


The data-transfer operation occurs as follows: 
a) The’C32 boot loader waits until the host sets XF1 low to read in the 
data. While the ’C32 waits for XF1 to drop low, the IACK pin pulses 


until XF1 is low. Setting XF1 low communicates to the ’C32 that the 
data is valid. The [ACK pulse indicates that the ‘C32 is waiting for data. 


b) The boot loader sets XFO low after reading the data value. Dropping 
XFO acknowledges to the host that the data was read. 


c) The host sets XF1 high to inform the ‘C32 that the data is no longer valid. 
d) The ’C32 terminates the transfer by setting XFO high. 


The memory boot-load source program has a header indicating the boot 
memory width and the contents of the STRBO, STRB1, and IOSTRB control 
registers (see Table 11-2). 


After reading the header, the boot loader copies the source-program 
blocks. The source-program blocks have three entries preceding the 
source-program-block data. The first entry in the source-program block 
indicates the size of the block, the second entry indicates the address where 
the block is to be loaded, while the third entry contains the destination- 
memory strobe including a pointer that identifies the destination-memory 
strobe (STRBO, STRB1, or IOSTRB) and a value that describes the strobe 
configuration for the memory width and data-type size. If the destination 
memory is internal, the third entry should contain a zero. The boot loader 
cannot load the source program to any memory address below 1000h, 
unless the address decode logic is remapped. 


Once all the program blocks are loaded into their respective address loca- 
tions with the given data-type sizes, the boot loader sets the IOSTRB, 
STRBO, and STRB1 control registers to the values read at the beginning 
of the boot-load process. 


The boot loader branches to the destination address of the first source 
block loaded and begins program execution. 
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Figure 11-4. TMS320C32 Boot-Loader Mode-Selection Flowchart 
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Figure 11-5.Boot-Loader Serial-Port Load Flowchart 
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Figure 11-6.Boot-Loader Memory-Load Flowchart 
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Figure 11-7.Handshake Data-Transfer Operation 
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11.2.4 TMS320C32 Boot Data Stream Structure 


Table 11-8 shows the data stream structure. The data stream is composed of 
a header of three (serial-port load) or four (memory load) words and one or 
more blocks of source data. The boot loader uses this header to determine the 
physical memory width where the source program resides (memory load) and 
to configure the STRBs after completion of source program boot load. The 
blocks of source data have three entries in addition to the raw data. The first 
entry in this block indicates the size of the block. The second entry in this block 
indicates the memory address where the boot loader copies this source block. 
The third entry contains the destination memory strobe configuration including 
memory width and data-type size. This allows the boot loader to copy and store 
8-, 16-, or 32-bit data values into 8-, 16-, or 32-bit wide memory. Words 8 
through n of the shaded entries in Table 11-8 contain the source data for the 
first block. 
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Table 11-8. Source Data Stream Structure 


Wordt 
1 


m+1 


Content 


Memory width (8, 16, or 32 bits) where source program 
resides 


Value to set the IOSTRB control register at end of boot loader 
process 


Value to set the STRBO control register at end of boot loader 
process 


Value to set the STRB1 control register at end of boot loader 
process 


Size of the first data block. The block size is the number of 
words in the data block (word length is specified by the data- 
type size). A 0 in this entry signifies the end of the source 
data stream. 

Destination address to load the first block 


First block destination memory width and data-type size in 
the format given in the Valid Data Entries column. 


First word of first block 


Last word of first block 


Size of the last data block. The block size is the number of 
words in the data block (word length is specified by the 
data-type size). If the next word following this block is not 0, 
another block is loaded. 


Destination address to load the last block 


TMS320C32 Boot Loader 


Valid Data Entries 


8h, 10h, or 20h, respectively 


See Section 10.7 on page 10-26 


See Section 10.3.1 on page 10-7 


See Section 10.6 on page 10-20 


0 <size < 224 


A valid ’C32 24-bit address 


SSSSSS6xht 


A’C82 valid instruction or any 8-, 
16-, or 32-bit wide data value 


A’C82 valid instruction or any 8-, 
16-, or 32-bit wide data value 


0 <size < 224 


A valid ’C32 
24-bit address 


T Word 1 does not exist in serial-port boot load since the source program does not reside in memory. 

+The SSSSSS hexadecimal digits refer to the lower 24 bits of the strobe control register. The x hexadecimal digit identifies 
the strobe as follows: 0 for IOSTRB, 4 for STRBO, and 8 for STRB1. SSSSSS6xh is cleared to 0 when loading the entire field 
into internal memory. 
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Table 11-8. Source Data Stream Structure (Continued) 


Wordt 


m+2 


m+3 


j+1 


Valid Data Entries 


Last block destination memory width and data-type size in SSSSSS6xht 
the format given in the Valid Data Entries column. 


First word of last block. A ’C82 valid instruction or any 8-, 


16-, or 32-bit wide data value 


Last word of last source block 


Zero word. Note that if more than one source block was read, Oh 
word / shown above would be the last word of the last block. 

Each block consists of header and data portions. The block’s 

header is shaded darker than the block’s data section. 


t Word 1 does not exist in serial-port boot load since the source program does not reside in memory. 
+The SSSSSS hexadecimal digits refer to the lower 24 bits of the strobe control register. The x hexadecimal digit identifies 
the strobe as follows: 0 for IOSTRB, 4 for STRBO, and 8 for STRB1. SSSSSS6xh is cleared to 0 when loading the entire field 


into internal memory. 
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Each source block can be loaded into a different memory location. Each block 
specifies its own size and destination address. The last source block of the 
data stream is appended with a zero word. Because the ’C32’s STRBs can be 
configured to support different external memory widths and data-type sizes, 
each source block specifies its data-type size. The external memory width is 
set when the boot loader reads the STRBx control register values in the source 
data stream header. 


To build a C32 boot data stream with the HEX30 utility provided with the 
TMS320 floating-point code-generation tools, use the following steps: 


Lj Compile/assemble code with -v32 switch using v4.7 or later of the 
TMS320 floating-point C compiler/assembler. If the code-generation tools 
are invoked with CL30 and -z switch, include —v32 switch in the linker 
command file. 


(1 Link as usual. 


(J) Run Hex30 utility version 4.7 or later. The -v32 switch used in the compiler/ 
assembler will create a header in the COFF file, identifying it as a ’C32 for 
the Hex30. 


TMS320C32 Boot Loader 


11.2.5 Boot-Loader Hardware Interface 


The hardware interface for the memory boot load uses the STRBX_B3 through 
STRBX_BO pins as strobe byte-enable pins (see Figure 11-8). The hardware 
interface is independent of the boot source memory width. This interface 
is identical to the 32-bit-wide memory interface described in Case 2, in Section 
10.6 on page 10-20. For 16-bit memory widths, remove the two left-most 
memory devices of Figure 11-8. For 8-bit memory widths, remove all but the 
right-most of the memory devices of Figure 11-8. 


Figure 11-8. External Memory Interface for Source Data Stream Memory Boot Load 
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11.2.6 TMS320C32 Boot-Loader Precautions 


The interrupt flags are not reset by the boot-loader function. If pending interrupts 
are to be avoided when interrupts are enabled, clear the IF register before enabling 
interrupts. 


The MCBL/MP pin should remain high during the entire boot-loading execution, 
but it can be changed subsequently at any time. The ‘C32 does not need to be 
reset after the MCBL/MP pin is changed. During the change, the ‘C32 should 
not access addresses Oh-FFh. The memory space Oh-FFFh is mapped to 
external memory three clock cycles after changing the MCBL/MP pin. 
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TMS320C32 Boot Loader 
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The ’C32 boot loader uses the following peripheral memory-mapped registers 
as a temporary stack: 


TimerO counter register (808024h) 

TimerO period register (808028h) 

DMAO source address register (808004h) 
DMAO destination address register (808006h) 
DMAO transfer counter register (808008h) 


ns a 


These memory-mapped registers are not reset by the boot-loading process. 
Before using these peripherals, reprogram these registers with the appropriate 
values. 


(04 a T=] 0) (=) am 4 


Peripherals 


The ’C3x features two timers, a serial port (two serial ports for the C30), and 
an on-chip direct memory access (DMA) controller (2-channel DMA controller 
on the ’C32). These peripheral modules are controlled through memory- 
mapped registers located on the dedicated peripheral bus. 


The DMA controller performs input/output operations without interfering with 
the operation of the CPU, making it possible to interface the ’C3x to slow, exter- 
nal memories and peripherals, analog-to-digital converters (A/Ds), serial 
ports, and so forth, without reducing the computational throughput of the CPU. 
The result is improved system performance and decreased system cost. 


Topic Page 
A DEAS TIMMONS rescece recs cce aces arareeateta cura, cesase eve aearepereneqaeh ce avave mia atarans aimee raneunaviesia ae 12-2 
1222: = Serial POMmtS ay esayezeatera ey cce ocarasevere svavers evaveus ave vaseeavarerarerere cera ni vcsetavers eral 12-15 
1253 (DMACGontrolle ty eiciere coca eee. crate reyeteranevavobeva: ete aravareyecerevs connpa avalevevalovenerare 12-48 


Timers 


12.1 Timers 


The 'C3x has two 32-bit general-purpose timer modules. Each timer has two 
signaling modes and internal or external clocking. You can use the timer 
modules to signal to the ’C3x or the external world at specified intervals or to 
count external events. With an internal clock, the timer can signal an external 
A/D converter to start a conversion, or it can interrupt the *C3x DMA controller 
to begin a data transfer. The timer interrupt is one of the internal interrupts. 
With an external clock, the timer can count external events and interrupt the 
CPU after a specified number of events. Each timer has an I/O pin that you can 
use as an input clock to the timer, as an output clock signal, or as a general- 
purpose |/O pin. 


Each timer consists of a 32-bit counter, a comparator, an input clock selector, 
a pulse generator, and supporting hardware (see Figure 12-1). A timer counts 
the cycles of a timer input clock with the counter register. When that counter 
register equals the value stored in the timer-period register, it resets the 
counter to 0 and produces a transition in the timer output signal. The timer input 
clock can be driven by either half the internal clock frequency of the ’C3x or 
an external clock on TCLKx pin. 


Figure 12-1. Timer Block Diagram 
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Counter (32-bit) 


C External clock 
INV 


Counter register 
(31-0) 


Period register (31-0) 
32 


Comparator 
? 


Period = counter 


Pulse generator 
> TSTAT 


Timer out 


INV 
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12.1.1 Timer Pins 


Each timer has one pin associated with the timer clock signal (TCLK) pin. This 
pin (TCK) is used as a general-purpose I/O signal, as a timer output, or as an 
input for an external clock for a timer. Each timer has a TCLK pin: TCLKO is 
connected to timer0, TCLK1 to timer1. 


12.1.2 Timer Control Registers 
Three memory-mapped registers are used by each timer: 


_) Global-control register 


The global-control register determines the operating mode of the timer, 
monitors the timer status, and controls the function of the I/O pin of the timer. 


_) Period register 


The period register specifies the timer’s signaling frequency. 


_) Counter register 


The counter register contains the current value of the incrementing counter. 
You can increment the timer on the rising edge or the falling edge of the 
input clock. The counter is zeroed and can cause an internal interrupt 
whenever its value equals that in the period register. The pulse generator 
generates either of two types of external clock signals: pulse or clock. The 
memory map for the timer modules is shown in Figure 12-2. 
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Figure 12-2. Memory-Mapped Timer Locations 


808020h Timer0 global controlt 


808024h TimerO countert 


808028h Timer0 period 


808030h Timer1 global controlt 


808034h Timer1 counter 


808038h Timer1 periodt 


tSee Section 12.1.3 
+See Section 12.1.4 


12.1.3 Timer Global-Control Register 


The timer global-control register is a 32-bit register that contains the global and 
port control bits for the timer module. Figure 12-3 shows the format of the 
timer global-control register. Bits 3-0 are the port control bits; bits 
11-6 are the timer global-control bits. At reset, all bits are set to 0 except for 
DATIN (which is set to the value read on TCLK). Table 12-1 describes the 
timer global-control register bits, their names, and functions. 


Figure 12-3. Timer Global-Control Register 


31. 16'15 12° 11 10 9 Go Ue Ff 6 ie) 4 ' 3 2 1 0 


xx ' xx! TSTAT | INV | CLKSRC C/P ' HLD xx! DATIN | DATOUT FUNC 
L) L) 
L L 


R R/W R/W Rw; RW RW R R/W RW R/W 


Notes: 1) R = read, W=write 
2) xx = reserved bit, read as 0 
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Table 12-1. 


Abbreviation 


FUNC 


vo 


DATOUT 


DATIN 
GO 


HLD 
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Timer Global-Control Register Bits Summary 


Reset 
Value 


0 


xt 


Name 


Function 


Input/output 


Data output 


Data input 
Go 


Counter hold 
signal 


Tx=0or1 (set to value read on TCLK) 


Description 


Controls the function of TCLK. 


If FUNC = 0, TCLK is configured as a general-purpose digital I/O 
port. 


If FUNC = 1, TCLK is configured as a timer pin. 


See section 12.1.6 Timer Operation Modes on page 12-10 for a 
description of the relationship between FUNC and CLKSRC. 


If FUNC = 0 and CLKSRC = 0, TCLK is configured as a general- 
purpose I/O pin. 


If /O = 0, TCLK is configured as a general-purpose input pin. 


If VO = 1, TCLK is configured as a general-purpose output pin. 


Drives TCLK when the ’C3x is in I/O port mode. You can use DAT- 
OUT as an input to the timer. 


Data input on TCLK or DATOUT. A write has no effect. 


Resets and starts the timer counter. 


When GO = 1 and the timer is not held, the counter is zeroed and 
begins incrementing on the next rising edge of the timer input clock. 
The GO bit is cleared on the same rising edge. 


GO = 0 has no effect on the timer. 


When this bit is 0, the counter is disabled and held in its current 
state. If the timer is driving TCLK, the state of TCLK is also held. 


The internal divide-by-2 counter is also held so that the counter can 
continue where it left off when HLD is set to 1. 


You can read and modify the timer registers while the timer is being 
held. RESET has priority over HLD. The effect of writing to GO and 
HOLD is shown below. 


GO HLD Result 


0 0 All timer operations are held. No reset is performed 
(reset value). 

0 1 Timer proceeds from state before write. 

1 0 _Alltimer operations are held, including zeroing of the 


counter. The GO bit is not cleared until the timer is 
taken out of hold. 


Timer resets and starts. 
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Table 12-1. 


Abbreviation 
c/P 


CLKSRC 


INV 


TSTAT 


Timer Global-Control Register Bits Summary (Continued) 


Reset 
Value 


0 


0 


0 


Name 


Clock/pulse 
mode control 


Clock source 


Inverter control 
bit 


Timer status bit 


tx=0or1 (set to value read on TCLK) 
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Description 


When C/P = 1, clock mode is chosen, and the signal- 
ing of the TSTAT flag and external output has a 50% 
duty cycle. 


When G/P = 0, the status flag and external output 
will be active for one H1 cycle during each timer 
period (see Figure 12-4 on page 12-8). 


This bit specifies the source of the timer clock. 


When CLKSRC = 1, an internal clock with a frequen- 
cy equal to one-half of the H1 frequency is used to 
increment the counter. The INV bit has no effect on 
the internal clock source. 


When CLKSRC = 0, you can use an external signal 
from the TCLK pin to increment the counter. The ex- 
ternal clock is synchronized internally, thus allowing 
external asynchronous clock sources that do not ex- 
ceed the specified maximum allowable external 
clock frequency. This is less than f(H1)/2. 


See section 12.1.6, Timer Operation Modes, on 
page 12-10 for a description of the relationship be- 
tween FUNC and CLKSRC. 


If an external clock source is used and INV = 1, the 
external clock is inverted as it goes into the counter. 


If the output of the pulse generator is routed to TCLK 
and INV = 1, the output is inverted before it goes to 
TCLK (see Figure 12-1 on page 12-2). 


If INV = 0, no inversion is performed on the input or 
output of the timer. The INV bit has no effect, regard- 
less of its value, when TCLK is used in I/O port 
mode. 


This bit indicates the status of the timer. It tracks the 
output of the uninverted TCLK pin. This flag sets a 
CPU interrupt on a transition from 0 to 1. A write has 
no effect. 
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12.1.4 Timer-Period and Counter Registers 


The 32-bit timer-period register is used to specify the frequency of the timer 
signaling. The timer-counter register is a 32-bit register, which is reset to 0 
whenever it increments to the value of the period register. Both registers are 
set to 0 at reset. 


Certain boundary conditions affect timer operation. These conditions are listed 
below: 


() When the period and counter registers are 0, the operation of the timer is 
dependent upon the C/P mode selected. In pulse mode (C/P = 0), TSTAT 
is set and remains set. In clock mode (C/P = 1), the width of the cycle is 
2/f(H1), and the external clocks are ignored. 


1 When the counter register is not 0 and the period register = 0, the counter 
counts, rolls over to 0, and behaves as described above. 


_j When the counter register is set to a value greater than the period register, 
the counter may overflow when incremented. Once the counter reaches its 
maximum 32-bit value (OFFFFFFFFh), it rolls over to 0 and continues. 


Writes from the peripheral bus override register updates from the counter and 
new status updates to the control register. 


12.1.5 Timer Pulse Generation 


The timer pulse generator (see Figure 12-1 on page 12-2) can generate several 
external signals. You can invert these signals with the INV bit. The two basic 
modes are pulse mode and clock mode, as shown in Figure 12-4. In both modes, 
an internal clock source f (timer clock) has a frequency of f(H1)/2, and an external- 
ly generated clock source f (timer clock) can have a maximum frequency of 
f(H1)/2.6. In pulse mode (C/P = 0), the width of the pulse is 1/f(H1). 
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Figure 12-4. Timer Timing 
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(a) TSTAT and timer output (INV = 0) when C/P =0 (pulse mode) 


«ol 4/(CLKSRC) | 
\e > period register/f(CLKSRC) 


t t t 


TINT TINT TINT 


(b) TSTAT and timer output (INV = 0) when C/P = 1 (clock mode) 
k¢—_—Y—_—+- 1/f(CLKSRC) 


4» —|- I aii) 
lq »— period register/((CLKSRC) | 


| 
}¢—_———————- 2 x period register/f(CLKSRC) ————————_ 


t t 


TINT TINT 


The timer signaling is determined by the frequency of the timer input clock and 
the period register. The following equations are valid with either an internal or 
an external timer clock: 


f(pulse mode) = f(timer clock) / period register 
f(clock mode) = f(timer clock) / (2 x period register) 


 ———————— — ——— — — — — —— — — — —  ——Ma uu a —=~———C.CT™.Cc— 


Note: Period register 
If the period register equals 0, see Section 12.1.4. 


Cd 


Example 12-1 provides some examples of the TCLKx output when the period 
register is set to various values and clock or pulse mode is selected. 
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Example 12-1. Timer Output Generation Examples 


(a) INV =0, C/P =0 (pulse mode) 
timer period = 1 
Also, _ 
INV = 0, C/P = 1 (clock mode) 
timer period = 0 


2H1 


HI 


(b) INV =0, C/P =0 (pulse mode) 
timer period = 2 


4H1 


HI 
JUSJUILILI 


(c) INV =0, C/P =0 (pulse mode) 
timer period = 3 


j— 6H1 > 
HI 


JUSJU_J LJ LT 


(d) INV =0, C/P =1 (clock mode) 
timer period = 1 
}¢— 4H1 >) 
> 2H1}— | 


(e) INV =0, C/P =1 (clock mode) 
timer period = 2 


(f) INV =0, C/P =1 (clock mode) 
timer period = 3 
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12.1.6 Timer Operation Modes 


The timer can receive its input and send its output in several different modes, 
depending upon the setting of CLKSRC, FUNC, and I/O. The four timer modes 
of operation are defined in the following sections. 


12.1.6.1| CLKSRC = 1 and FUNC = 0 


If CLKSRC = 1 and FUNC = 0, the timer input comes from the internal clock. 
The internal clock is not affected by the INV bit in the global-control register. 
In this mode, TCLK is connected to the I/O port control, and you use TCLK as 
a general-purpose I/O pin (see Figure 12-5). 


If I/O =0, TCLK is configured as a general-purpose input pin whose state 
you can read in DATIN. DATOUT has no effect on TCLK or DATIN. See 
Figure 12-5 (a). 


If I/O = 1, TCLK is configured as a general-purpose output pin. DATOUT 
is placed on TCLK and can be read in DATIN. See Figure 12-5 (b). 


Figure 12-5. Timer Configuration with CLKSRC = 1 and FUNC = 0 


(a) CLKSRC=1 (internal) (b) | CLKSRC = 1 (internal) 
FUNG=0 (I/O pin) FUNG = 0 (I/O pin) 
/O=0 (input) I/O = 1 (output) 


Internal External 


Internal | External Timer 


Timer Internal 
Internal | clock 
Output |» 
E DATOUT > TCLK 


I/O port 
control 


DATIN 


| TSTAT 


TSTAT 


I/O port 
control 


DATIN 
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12.1.6.2 CLKSRC = 1 and FUNC = 1 


If CLKSRC = 1 and FUNC = 1 (see Figure 12-6), the timer input comes from 
the internal clock, and the timer output goes to TCLK. This value can be inverted 
using INV, and you can read in DATIN the value output on TCLK. 


Figure 12-6. Timer Configuration with CLKSRC = 1 and FUNC = 1 


Internal | External 


Internal 


TSTAT DATIN 


CLKSRC = 1 (internal) 
FUNC = 1 (timer pin) 


12.1.6.3 CLKSRC = 0 and FUNC =0 


If CLKSRC = 0 and FUNC = 0 (see Figure 12-7), the timer is driven according 
to the status of the I/O bit. 


1 If I/O =0, the timer input comes from TCLK. This value can be inverted 
using INV, and you can read in DATIN the value of TCLK. See 
Figure 12-7 (a). 


 If/O = 1, TCLK is an output pin. Then, TCLK and the timer are both driven 
by DATOUT. All 0-to-1 transitions of DATOUT increment the counter. INV 
has no effect on DATOUT. You can read in DATIN the value of DATOUT. 
See Figure 12-7 (b). 


Figure 12-7. Timer Configuration with CLKSRC = 0 and FUNC = 0 


CLKSRC = 0 (external) CLKSRC = 0 (external) 
FUNC = 0 (W/O pin) FUNC = 0 (W/O pin) 
/O = 0 (input) /O = 1 (output) 
(a) (b) 
Timer Internal | External Timer Internal | External 
TCLK TCLK 
DATIN DATOUT 
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12.1.6.4 CLKSRC =0 and FUNC = 1 
If CLKSRC = 0 and FUNC = 1 (see Figure 12-8), TCLK drives the timer. 


Lj If INV =0, all 0-to-1 transitions of TCLK increment the counter. 


Li If INV = 1, all 1-to-0 transitions of TCLK increment the counter. You can 
read in DATIN the value of TCLK. 


Figure 12-8. Timer Configuration with CLKSRC = 0 and FUNC = 1 


Timer Internal | External 
In | TCLK 
Out | 
TSTAT DATIN 


CLKSRC = 0 (external) 
FUNC = 1 (timer pin) 


12.1.7 Using TCLKx as General-Purpose I/O Pins 


When FUNC = 0, TCLKx can be used as an I/O pin. Figure 12-9 and 
Figure 12-10 show how the TCLKx is connected when it is configured as a 
general-purpose I/O pin. In Figure 12-9, the I/O bit equals 0 and TCLK is 
configured as an input pin whose value can be read in the DATIN bit. In 
Figure 12-10, the I/O bit equals 1 and TCLK is configured as an output pin that 
outputs the value you wrote in the DATOUT bit. 


Figure 12-9. TCLK as an Input (I/O = 0) 


Internal | External 
DATOUT (NC) ro) + TCLK 
| | 
DATIN 
VO =0 
FUNC =0 
Figure 12-10. TCLK as an Output (I/O = 1) 
| 
Internal | External 
| 
DATOUT > |+— > TCLK 
| | 
DATIN 
VO =1 
FUNC =0 
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12.1.8 Timer Interrupts 


A timer interrupt is generated whenever the TSTAT bit of the timer control register 
changes from a 0 to a 1. The frequency of timer interrupts depends on whether 
the timer is set up in pulse mode or clock mode. 


LJ In pulse mode, the interrupt frequency is determined by the following 


equation: 

. _ _[timer clock) 
(interrupt) period register 
where: 


interrupt) = interrupt frequency 
{timer clock) = timer frequency 


[J In clock mode, the interrupt frequency is determined by the following 


equation: 
f Fi 
f, 7 (timer clock) 
(interrupt) 2 x period register 
where: 


interrupt) = interrupt frequency 
{timer clock) = timer frequency 


The timer counter is automatically reset to 0 whenever it is equal to the value 
in the timer-period register. You can use the timer interrupt for either the CPU 
or the DMA. Interrupt-enable control for each timer, for either the CPU or the 
DMA, is found in the CPU/DMA interrupt-enable register. Refer to Section 3.1.8, 
CPU/DMA Interrupt-Enable Register (IE), on page 3-9 for more information. 


When a timer interrupt occurs, a change in the state of the corresponding 
TCLK pin is observed if FUNC = 1 and CLKSRC = 1 in the timer global-control 
register. The exact change in the state depends on the state of the C/P bit. In 
pulse mode (C/P = 0), the width of the pulse change is 1/f (H1). In clock mode 
(C/P = 1), the width of the pulse change is the period register divided by the 
frequency of the timer input clock. 


12.1.9 Timer Initialization/Reconfiguration 


The timers are controlled through memory-mapped registers located on the 
dedicated peripheral bus. The general procedure for initializing and/or recon- 
figuring the timers follows: 


1) Halt the timer by clearing the GO/HLD bits of the timer global-control register. 
To do this, write a 0 to the timer global-control register. Note that the timers 
are halted on RESET. 
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2) Configure the timer through the timer global-control register (with GO = 
HLD = 0), the timer-counter register, and timer-period register, if necessary. 


3) Start the timer by setting the GO/HLD bits of the timer global-control register. 


Example 12-2 shows how to set up the ‘C3x timer to generate the maximum 
clock frequency through the TCLKx pin. 


Example 12-2. Maximum Frequency Timer Clock Setup 


* 


Timer0O 
TCTRL_ RST 
TCTRL_GD 
TCNT 

TPRD 


* Maximum Frequency Timer Clock Setup 


.data 

.word 808020h ; Timer global control address 
.word 301h 

.word 3C1lh 

.word 0 ; Timer counter value 

. word 0 ; Timer-period value 

- text 

LDP Timer0O 

LD1 @Timer0,ARO ; Load data page pointer 
LD1 0,RO 

ST1 RO, *ARO ; Halt timer 

LD1 @TCTRL_RST,RO ; Configure timer 

STL RO, *ARO 

LD1 @TCNT, RO 

ST1 RO, *+ARO (4) ; Load timer counter 

LD1 @TPRD, RO 

ST1 RO, **+ARO (8) ; Load timer period 

LD1 @TCTRL_GO,RO 

ST1 RO, *ARO ; Start timer 
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Serial Ports 


The ’C30 has two totally independent bidirectional serial ports. Both serial ports 
are identical, and there is a complementary set of control registers in each one. 
Only one serial port is available on the ’C31/VC33 and the ’C32. You can config- 
ure each serial port to transfer 8, 16, 24, or 32 bits of data per word simulta- 
neously in both directions. The clock for each serial port can originate either inter- 
nally, through the serial port timer and period registers, or externally, through a 
supplied clock. An internally generated clock is a divide down of the clockout fre- 
quency, f(H1). A continuous transfer mode is available, which allows the serial 
port to transmit and receive any number of words without new synchronization 
pulses. 


Eight memory-mapped registers are provided for each serial port: 


Global-control register 

Two control registers for the six serial I/O pins 
Three receive/transmit timer registers 
Data-transmit register 

Data-receive register 


OHOUOUCU 


The global-control register controls the global functions of the serial port and 
determines the serial-port operating mode. Two port control registers control 
the functions of the six serial port pins. The transmit buffer contains the next 
complete word to be transmitted. The receive buffer contains the last complete 
word received. Three additional registers are associated with the transmit/ 
receive sections of the serial-port timer. A serial-port block diagram is shown 
in Figure 12-11 on page 12-16, and the memory map of the serial ports is 
shown in Figure 12-12 on page 12-17. 
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Figure 12-11. Serial Port Block Diagram 


Receive Section "| i Transmit Section _ 


CLKR CLKX 
Receive =v | «CUT STAT Transmit 
timer (16) os timer (16) 


Receive Clock 


Bit counter Bit counter 
(8/16/24/32) (8/16/24/32) 


Load Load 
control control 
DX DX 
DXR 
(32) 
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Figure 12-12. Memory-Mapped Locations for the Serial Ports 


808040h 


Serial-port 0 global controlt 


808042h 


Serial port 0 FSX/DX/CLKX controlt 


808043h 


Serial port 0 FSR/DR/CLKR control§ 


808044h 


Serial port 0 R/X timer control 


808045h 


Serial port 0 R/X timer counter# 


808046h 


Serial port 0 R/X timer period! 


808048h 


Serial port 0 data transmit* 


80804Ch 


Serial port 0 data receive 


808050h 


Serial-port 1 global controlt 


808052h 


Serial port 1 FSX/DX/CLKX controlt 


808053h 


Serial port 1 FSR/DR/CLKR control$ 


808054h 


Serial port 1 R/X timer control 


808055h 


Serial port 1 R/X timer counter# 


808056h 


Serial port 1 R/X timer period! 


808058h 


Serial port 1 data transmit* 


80805Ch 


Serial port 1 data receive 


Note: Serial port1 locations are reserved on the ’'C31/VC33 and ’C32. 


Tt See Figure 12-13. 
+ See Figure 12-14. 
§ See Figure 12-15. 
See Figure 12-16. 
# See Figure 12-17. 
ll See Figure 12-18. 
* See Figure 12-19. 
[See Figure 12-20. 


12.2.1 Serial-Port Global-Control Register 


The serial-port global-control register is a 32-bit register that contains the global- 
control bits for the serial port. The register is shown in Figure 12-13. Table 12-2 
shows the register bits, bit names, and bit functions. 
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Figure 12-13. Serial-Port Global-Control Register 


30 29 28° 24 23 22 21 20 19 18 17 16 

EN Ee 
RW RW RW RW RW, RW RW RW RW 
e - 7 6 5 4 ¢ 3 2 1 0 

peieieeeas von [|= [AL ah 
RW 1 RW RW RW 1 RW RW RW ROR RW R R 


Notes: 1) R = read, W =write 
2) xx = reserved bit, read as 0 


Table 12-2. Serial-Port Global-Control Register Bits Summary 


Reset 

Abbreviation Value Name Description 

RRDY 0 Receive ready flag If RRDY = 1, the receive buffer has new data and is ready to 
be read. A three H1/H3 cycle delay occurs from the loading 
of DRR to RRDY = 1. The rising edge of this signal sets RINT. 
If RRDY = 0, the receive buffer does not have new data since 
the last read. RRDY = 0 at reset and after the receive buffer is 
read. 

XRDY 1 Transmit ready flag If XRDY = 1, the transmit buffer has written the last bit of data 
to the shifter and is ready for a new word. A three H1/H3 cycle 
delay occurs from the loading of the transmit shifter until 
XRDY is set to 1. The rising edge of this signal sets XINT. 

If XRDY = 0, the transmit buffer has not written the last bit of 
data to the transmit shifter and is not ready for a new word. 

FSXOUT Transmit frame sync = FSXOUT = 0 configures the FSX pin as an input. 

configuration FSXOUT = 1 configures the FSX pin as an output. 

XSREMPTY 0 Transmit-shift If XSREMPTY = 0, the transmit-shift register is empty. 

register emply flag XSREMPTY = 1, the transmit-shift register is not empty. 
Reset or XRESET causes this bit to = 0. 
RSRFULL 0 Receive-shift register If RSRFULL = 1, an overrun of the receiver has occurred. In 


full flag continuous mode, RSRFULL is set to 1 when both RSR and 
DRR are full. In noncontinuous mode, RSRFULL is set to 1 
when RSR and DRR are full and a new FSR is received. A read 
causes this bit to be set to 0. This bit can be set to 0 only by a 
system reset, a serial-port receive reset (RRESET = 1), ora 
read. When the receiver tries to set RSRFULL to 1 at the same 
time that the global register is read, the receiver dominates, 
and RSRFULL is set to 1. 


lf RSRFULL = 0, no overrun of the receiver has occurred. 
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Table 12-2. Serial-Port Global-Control Register Bits Summary (Continued) 


Reset 
Abbreviation Value 
HS 0 


XCLK SRCE 0 


RCLK SRCE 0 


XVAREN 0 
RVAREN 0 
XFSM 0 
RFSM 0 
CLKXP 


Name 


Handshake 


Transmit clock 
source 


Receive clock 
source 


Transmit data rate 
mode 


Receive data rate 
mode 


Transmit frame 
sync mode 


Receive frame 
sync mode 


CLKX polarity 


Description 
If HS = 1, the handshake mode is enabled. 
If HS = 0, the handshake mode is disabled. 


If XCLK SRCE = 1, the internal transmit clock is used. 
lf XCLK SRCE = 0, the external transmit clock is used. 


If RCLK SRCE = 1, the internal receive clock is used. 
lf RCLK SRCE = 0, the external receive clock is used. 


Specifies a fixed or variable data rate mode when transmitting. 


With a fixed data rate, FSX is active for at least one XCLK 
cycle and then goes inactive before transmission begins. 


With variable data rate, FSX is active while all bits are being 
transmitted. When you use an external FSX and variable data 
rate signaling, the DX pin is driven by the transmitter when 
FSX is held active or when a word is being shifted out. 


Specifies a fixed or variable data rate mode when receiving. 


If RVAREN = 0 (fixed data rate), FSX is active for at least one 
RCLK cycle and then goes inactive before reception begins. 


If RVAREN = 1 (controlled data rate), FSX is active while all 
bits are being received. 

Configures the port for continuous mode operation or standard 
mode operation. 


If XFSM = 1 (continuous mode), only the first word of a block 
generates a sync pulse, and the rest are transmitted continuously 
to the end of the block. 


If XFSM = 0 (standard mode), each word has an associated 
sync pulse. 

Configures the port for continuous mode operation or standard 
mode operation. 


If RFSM = 1 (continuous mode), only the first word of a block 
generates async pulse, and the rest are received continuously 
to the end of the block. 


If RFSM = 0 (standard mode), each word received has an 
associated sync pulse. 


If CLKXP = 0, CLKX is active high. 
If CLKXP = 1, CLKX is active low. 
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Table 12-2. 


Abbreviation 
CLKRP 


DXP 


DRP 


FSXP 


FSRP 


XLEN 


RLEN 


XTINT 


XINT 


RTINT 
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Serial-Port Global-Control Register Bits Summary (Continued) 


Reset 
Value 


0 


00 


00 


Name 
CLKR polarity 


DX polarity 


DR polarity 


FSX polarity 


FSR polarity 


Transmit word length 


Receive word length 


Transmit timer 
interrupt enable 


Transmit interrupt 
enable 


Receive timer 
interrupt enable 


Description 
If CLKRP = 0, CLKR is active (high). 
If CLKRP = 1, CLKR is active (low). 


If DXP = 0, DX is active (high). 
If DXP = 1, DX is active (low). 


If DRP = 0, DR is active (high). 
If DRP = 1, DR is active (low). 


If FSXP = 0, FSX is active (high). 
If FSXP = 1, FSX is active (low). 


If FSRP = 0, FSR is active (high). 
If FSRP = 1, FSR is active (low). 
These two bits define the word length of serial data trans- 


mitted. All data is assumed to be right justified in the transmit 
buffer when fewer than 32 bits are specified. 


00 — 8 bits 
01 — 16 bits 


10 — 24 bits 
11 — 32 bits 


These two bits define the word length of serial data received. 
All data is right justified in the receive buffer. 


00 — 8 bits 
01 — 16 bits 


10 — 24 bits 
11 — 32 bits 


If XTINT = 0, the transmit timer interrupt is disabled. 


If XTINT = 1, the transmit timer interrupt is enabled. 


If XINT = 0, the transmit interrupt is disabled. 
If XINT = 1, the transmit interrupt is enabled. 


Note: The CPU receive flag XINT and the serial-port-to-DMA 
interrupt (EXINTO in the IE register) is the OR of the enabled 
transmit timer interrupt and the enabled transmit interrupt. 


If RTINT = 0, the receive timer interrupt is disabled. 


If RTINT = 1, the receive timer interrupt is enabled. 
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Table 12-2. Serial-Port Global-Control Register Bits Summary (Continued) 


Abbreviation 
RINT 


XRESET 


RRESET 


Reset 
Value 


0 


0 


Name 


Receive interrupt 
enable 


Transmit reset 


Receive reset 


Description 
If RINT = 0, the receive interrupt is disabled. 
If RINT = 1, the receive interrupt is enabled. 


Note: The CPU receive interrupt flag RINT and the serial- 
port-to-DMA interrupt (ERINTO in the IE register) are the OR 
of the enabled receive timer interrupt and the enabled receive 
interrupt. 


If XRESET = 0, the transmit side of the serial port is reset. 


To take the transmit side of the serial port out of reset, set 
XRESET to 1. 


Do not set XRESET to 1 until at least three cycles after RESET 
goes inactive. This applies only to system reset. Setting XRESET 
to 0 does not change the contents of any of the serial-port control 
registers. It places the transmitter in a state corresponding to the 
beginning of a frame of data. Resetting the transmitter generates 
a transmit interrupt. Reset this bit during the time the mode of the 
transmitter is set. You can toggle XFSM without resetting the 
global-control register. 


If RRESET = 0, the receive side of the serial port is reset. 


To take the receive side of the serial port out of reset, set RRE- 
SET to 1. 


Do not set RRESET to 1 until at least three cycles after RESET 
goes inactive. This applies only to system reset. Setting RRESET 
to 0 does not change the contents of any of the serial-port control 
registers. It places the receiver in a state corresponding to the 
beginning of a frame of data. Reset this bit at the same time that 
the mode of the receiver is set. You can toggle without resetting 
the global-control register. 


Peripherals 12-21 


Serial Ports 


12.2.2 FSX/DX/CLKX Port-Control Register 


This 32-bit port-control register controls the function of the serial port FSX, DX, 
and CLKX pins. The register is shown in Figure 12-14. Table 12-3 shows the 
register bits, bit names, and bit functions. 


Figure 12-14. FSX/DX/CLKX Port-Control Register 


Notes: 1) R = read, W =write. 


2) xx = reserved bit, read as 0. 


Table 12-3. FSX/DX/CLKX Port-Control Register Bits Summary 


Reset 
Abbreviation Value Name Description 
CLKX FUNC 0 Clock transmit Controls the function of CLKX. 
eon If CLKX FUNC = 0, CLKX is configured as a general-purpose 
digital I/O port. 
If CLKX FUNC = 1, CLKX is configured as a serial port pin. 
CLKX 1/0 0 Clock transmit If CLKX VO = 0, CLKX is configured as a general-purpose input 
input/output pin. 
mode If CLKX VO = 1, CLKX is configured as a general-purpose output 
pin. 
CLKX DATOUT 0 Clock transmit Data output on CLKX when configured as general-purpose 
data ouput output. 
CLKX DATIN xt Clock transmit Data input on CLKX when configured as general-purpose input. 
data input A write has no effect. 
DX FUNC 0 DX function DXFUNC controls the function of DX. 
If DXFUNC = 0, DX is configured as a general-purpose digital 
I/O port. 
If DXFUNC = 1, DX is configured as a serial port pin. 
Dx I/O 0 DX input/output If DX I/O = 0, DX is configured as a general-purpose input pin. 
mpegs If DX VO = 1, DX is configured as a general-purpose output pin. 
DX DATOUT 0 DX data output Data output on DX when configured as general-purpose output. 
tx =0or1 


12-22 


Serial Ports 


Table 12-3. FSX/DX/CLKX Port-Control Register Bits Summary (Continued) 


Reset 

Abbreviation Value Name Description 

DX DATIN xt DX data input Data input on DX when configured as general-purpose input. 
A write has no effect. 

FSX FUNC 0 FSX function Controls the function of FSX. 
If FSX FUNC = 0, FSX is configured as a general-purpose digital 
I/O port. 
If FSX FUNC = 1, FSX is configured as a serial port pin. 

FSX 1/0 0 FSX input/output If FSX 1/O = 0, FSX is configured as a general-purpose input 

mode pin. 

If FSX VO = 1, FSX is configured as a general-purpose output 
pin. 

FSX DATOUT 0 FSX data output Data output on FSX when configured as general-purpose output. 

FSX DATIN xt FSX data input Data input on FSX when configured as general-purpose input. 


A write has no effect. 


Tx =0or1 


12.2.3 FSR/DR/CLKR Port-Control Register 


This 32-bit port-control register is controlled by the function of the FSR, DR, 
and CLKR pins. At reset, all bits are set to 0. The register is shown in 
Figure 12-15. Table 12-4 shows the register bits, bit names, and bit functions. 


Figure 12-15. FSR/DR/CLKR Port-Control Register 


' FSR FSR FSR FSR : DR DR DR ; CLKR CLKR CLKR | CLKR 
4 DATIN | DATOUT vO FUNC 49 DATIN |} DATOUT VO aKis « DATIN DATOUT VO FUNC 


Notes: 1) R = read, W=write 
2) xx = reserved bit, read as 0 
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Table 12-4. FSR/DR/CLKR Port-Control Register Bits Summary 


Abbreviation 


CLKR FUNC 


CLKR I/O 


CLKR DATOUT 


CLKR DATIN 


DR FUNC 


DRI/O 


DR DATOUT 


DR DATIN 


FSR FUNC 


FSR I/O 


FSR DATOUT 


Tx =Oor1. 
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Reset 
Value 


0 


xt 


xt 


Name 


Clock receive 
function 


Clock receive 
input/output 
mode 


Clock receive 
data output 


Clock receive 
data input 


DR function 


DR input/output 
mode 


DR data output 


DR data input 


FSR function 


FSR input/output 
mode 


FSR data output 


Description 


Controls the function of CLKR. 


If CLKR FUNC = 0, CLKR is configured as a general-purpose 
digital I/O port. 


If CLKR FUNC = 1, CLKR is configured as a serial port pin. 


If CLKR I/O = 0, CLKR is configured as a general-purpose input 
pin. 


If CLKR VO = 1, CLKRis configured as a general-purpose output 
pin. 


Data output on CLKR when configured as general-purpose 
output. 


Data input on CLKR when configured as general-purpose input. 
A write has no effect. 
Controls the function of DR. 


If DR FUNC = 0, DR is configured as a general-purpose digital 
I/O port. 


If DR FUNC = 1, DR is configured as a serial port pin. 


If DR I/O = 0, DRis configured as a general-purpose input pin. 


If DRVO = 1, DRis configured as a general-purpose output pin. 
Data output on DR when configured as general-purpose output. 


Data input on DR when configured as general-purpose input. 
A write has no effect. 
FSR FUNC controls the function of FSR. 


If FSR FUNC = 0, FSR is configured as a general-purpose digital 
I/O port. 


If FSR FUNC = 1, FSR is configured as a serial port pin. 


If FSR I/O = 0, FSRis configured as a general-purpose input 
pin. 


If FSR V/O = 1, FSRis configured as a general-purpose output 
pin. 


Data output on FSR when configured as general-purpose output. 
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Table 12-4. FSR/DR/CLKR Port-Control Register Bits Summary (Continued) 


Reset 
Abbreviation Value Name Description 
FSR DATIN xt FSR data input Data input on FSR when configured as general-purpose input. 


A write has no effect. 


Tx =0Oor1. 


12.2.4 Receive/Transmit Timer-Control Register 


A 32-bit receive/transmit timer-control register contains the control bits for the 
timer module. At reset, all bits are set to 0. Figure 12-16 shows the register. 
Bits 5-0 control the transmitter timer. Bits 11-6 control the receiver timer. The 
serial port receive/transmit timer function is similar to timer module operation. 
It can be considered a 16-bit-wide timer. Table 12-5 describes the register 
bits, bit names, and bit functions. 


Figure 12-16. Receive/Transmit Timer-Control Register 


RCLKSRC | Rc/P $ RHLD 


, a R R/W R/W ‘ R/W R/W R 7 R/W R/W R/W R/W 


Notes: 1) R = read, W=write 


2) xx = reserved bit, read as 0 


Table 12-5. Receive/Transmit Timer-Control Register Register Bits Summary 


Reset 
Abbreviation Value Name Function 
XGO 0 Transmit timer counter Resets and restarts the transmit timer counter. 
feta lf XGO = 1 and the timer is not held, the counter is zeroed 
and begins incrementing on the next rising edge of the timer 
input clock. 
The XGO bit is cleared on the same rising edge. Writing 0 
to XGO has no effect on the transmit timer. 
XHLD 0 Transmit counter hold If XHLD = 0, the counter is disabled and held in its current 
signal state. 


If XHLD = 1, the internal divide-by-two counter is also held so 
that the counter continues where it left off. 
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Table 12-5. Receive/Transmit Timer-Control Register Register Bits Summary (Continued) 


Reset 
Abbreviation Value 
Xc/P 0 
XCLKSRC 0 
XTSTAT 0 
RGO 0 
RHLD 0 
RC/P 0 
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Name 


Transmit clock/pulse 
mode control 


Transmit clock source 


Transmit timer status 


Receive timer counter 
restart 


Receive counter hold 
signal 


Rclock/pulse mode 
control 


Function 


When XC/P = 1, the clock mode is chosen. The signaling of 
the status flag and external output has a 50 percent duty 
cycle. 


When XC/P = 0, the status flag and external output are active 
for one CLKOUT cycle during each timer period. 


Specifies the source of the transmit timer clock. 


When XCLKSRC = 1, an internal clock with frequency equal 
to one-half the CLKOUT frequency is used to increment the 
counter. 


When XCLKSRC = 0, you can use an external signal from 
the CLKX pin to increment the counter. 


The external clock source is synchronized internally, thus 
allowing for external asynchronous clock sources that do 
not exceed the specified maximum allowable external clock 
frequency, that is, less than f(H1)/2.6. 


Indicates the status of the transmit timer. It tracks what 
would be the output of the uninverted CLKX pin. 


This flag sets a CPU interrupt on a transition from 0 to 1.A 
write has no effect. 


Resets and starts the receive timer counter. 


When RGO is set to 1 and the timer is not held, the counter 
is zeroed and begins incrementing on the next rising edge 
of the timer input clock. 


The RGO bit is cleared on the same rising edge. Writing 0 
to RGO has no effect on the receive timer. 


If RHLD = 0, the counter is disabled and held in its current 
state. 


If RHLD = 1, the internal divide-by-2 counter is also held so 
that the counter will continue where it left off. 


You can read and modify the timer registers while the timer 
is being held. RESET has priority over RHLD. 


When RG/P = 1, the clock mode is chosen. The signaling of 
the status flag and external output has a 50% duty cycle. 


When RC/P = 0, the status flag and external output are active 
for one CLKOUT cycle during each timer period. 
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Table 12-5. Receive/Transmit Timer-Control Register Register Bits Summary (Continued) 


Reset 
Abbreviation Value Name 
RCLKSRC 0 Receive timer clock 
source 
RTSTAT 0 Receive timer status 


Function 
Specifies the source of the receive timer clock. 


When RCLKSRC = 1, an internal clock with frequency equal 
to one-half the CLKOUT frequency is used to increment the 
counter. 


When RCLKSRC = 0, you can use an external signal from 
the CLKR pin to increment the counter. The external clock 
source is synchronized internally, allowing for external 
asynchronous clock sources that do not exceed the speci- 
fied maximum allowable external clock frequency (that is, 
less than f(H1)/2.6). 


Indicates the status of the receive timer. It tracks what would 
be the output of the uninverted CLKR pin. 


This flag sets a CPU interrupt on a transition from 0 to 1.A 
write has no effect. 


12.2.5 Receive/Transmit Timer-Counter Register 


The receive/transmit timer-counter register is a 32-bit register (see 
Figure 12-17). Bits 15-0 are the transmit timer-counter, and bits 31-16 are 
the receive timer-counter. Each counter is cleared to 0 whenever it increments 
to the value of the period register (see Section 12.2.6). /t is also set to O at reset. 


Figure 12-17. Receive/Transmit Timer-Counter Register 


31 


16 


Receive counter 


15 


Transmit counter 


Note: All bits are read/write. 
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12.2.6 Receive/Transmit Timer-Period Register 


The receive/transmit timer-period register is a 32-bit register (see Figure 12-18). 
Bits 15-0 are the timer transmit period, and bits 31-16 are the receive period. 
Each register specifies the period of the timer and is cleared to 0 at reset. 


Figure 12-18. Receive/Transmit Timer-Period Register 
31 16 


Receive period 


Transmit period 


Note: All bits are read/write. 


12.2.7 Data-Transmit Register 


When the data-transmit register (DXR) is loaded, the transmitter loads the word 
into the transmit-shift register (XSR), and the bits are shifted out. The delay from 
a write to DXR until an FSX occurs (or can be accepted) is two CLKX cycles. 
The word is not loaded into the shift register until the shifter is empty. When DXR 
is loaded into XSR, the XRDY bit is set, specifying that the buffer is available 
to receive the next word. Four tap points within the transmit-shift register are 
used to transmit the word. These tap points correspond to the four-data word 
sizes and are illustrated in Figure 12-19. The shift is a left-shift (LSB to MSB) 
with the data shifted out of the MSB corresponding to the appropriate tap point. 


Figure 12-19. Transmit Buffer Shift Operation 
< Shift direction — 
31 24 23 16 15 8 7 0 


32-bit word tap 24-bit word tap 16-bit word tap 8-bit word tap 


12.2.8 Data-Receive Register 


When serial data is input, the receiver shifts the bits into the receive-shift register 
(RSR). When the specified number of bits are shifted in, the data-receive register 
(DRR) is loaded from RSR, and the RRDY status bit is set. The receiver is double- 
buffered. If the DRR has not been read and the RSR is full, the receiver is frozen. 
New data coming into the DR pin is ignored. The receive shifter does not write over 
the DRR. The DRR must be read to allow new data in the RSR to be transferred 
to the DRR. When a write to DRR occurs at the same time that an RSR-to-DRR 
transfer takes place, the RSR-to-DRR transfer has priority. 
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Data is shifted to the left (LSB to MSB). Figure 12-20 illustrates what happens 
when words less than 32 bits are shifted into the serial port. In this figure, it is 
assumed that an 8-bit word is being received and that the upper three bytes 
of the receive buffer are originally undefined. In the first portion of the figure, 
byte ahas been shifted in. When byte bis shifted in, byte ais shifted to the left. 
When the data-receive register is read, both bytes a and bare read. 


Figure 12-20. Receive Buffer Shift Operation 


< Shift direction — 


31 24 23 16 15 8 7 0 
After byte a xX x xX a 


After byte b xX xX a b 


12.2.9 Serial-Port Operation Configurations 


Several configurations are provided for the operation of the serial-port clocks 
and timer. The clocks for each serial port can originate either internally or exter- 
nally. Figure 12-21 shows serial-port clocking in the I/O mode (CLKXFUNC 
= 0) when CLKx is either an input or an output. Figure 12-22 shows clocking 
in the serial-port mode (CLKXFUNC=1). Both figures use a transmit section 
for an example. The same relationship holds for a receive section. 
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Figure 12-21. Serial-Port Clocking in I/O Mode 


CLKX FUNC= 0 (I/O mode) 
CLKX I/O = 1 (CLKX, an output) 
XCLK SRC = 1 (internal CLK for timer) 


(a) 
Internal | External 


Internal ! 


: CLKX 
XSR 


DATOUT —>—— 
DATIN 


CLKX FUNC= 0 (I/O mode) 
CLKX I/O. = 0 (CLKX, an input) 
XCLK SRC = 1 (internal CLK for timer) 


(c) 


Internal | External 


TSTAT —— Internal | 
<—EBiimer itt += cick | 


DATOUT (NC) —° 
DATIN 
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CLKX FUNC= 0 (I/O mode) 
CLKX I/O =1 (CLKX, an output) 
XCLK SRC = 0 (external CLK for timer) 


(b) 
— External 


TSTAT 


DATAOUT—»—__ 
DATIN 


CLKX 


CLKX FUNC= 0 (I/O mode) 
CLKX I/O = 0 (CLKX, an input) 
XCLK SRC = 0 (external CLK for timer) 


(d) 
Internal | External 
TSTAT, 
| 
XSR | 


DATOUT (NC) —9 
DATIN 


CLKX 


Figure 12-22. Serial-Port Clocking in Serial-Port Mode 
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CLKX FUNC= 1 (serial-port mode) CLKX FUNC= 1 (serial-port mode) 
CLKX I/O = 1 (output serial-port CLK) CLKX I/O = 0 (input serial-port CLK) 
XCLK SRC =Oor1 XCLK SRC = 1 (internal CLK for timer) 

(a) (b) 
Internal) External eee External 
TSTAT — Internal TSTAT— Internal 
clock : clock : 
CLKX 
Be SR Ct ou 
DATOUT (NC) ——o INV DATOUT (NC) ——o 
DATIN DATIN INV 


CLKX FUNC= 1 (serial-port mode) 
CLKX I/O = 0 (input serial-port CLK) 
XCLK SRC =0 (external CLK for timer) 


(c) 
Internal | External 
| 
TSTAT 


| 
tay CLKX 


XSR 
DATOUT (NC) ——o INV 
DATIN-—— 


12.2.10 Serial-Port Timing 


The formula for calculating the frequency of the serial-port clock with an inter- 
nally generated clock depends upon the operation mode of the serial-port 


timers, defined as: 


f (pulse mode) = f (timer clock)/period register 
f (clock mode) = f (timer clock)/(2 x period register) 


An internally generated clock source f (timer clock) has a maximum frequency 
of f(H1)/2. An externally generated serial-port clock f (timer clock) (CLKX or 
CLKR) has a maximum frequency of less than f(H1)/2.6. See section 12.1.5 
on page 12-7 for information on timer pulse/clock generation. 


Transmit data is clocked out on the rising edge of the selected serial-port clock. 
Receive data is latched into the receive-shift register on the falling edge of the 
serial-port clock. All data is received MSB first and shifted to the left. If fewer than 
32 bits are received, the data received is right-justified in the receive buffer. 
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The transmit ready (XRDY) signal specifies that the data-transmit register 
(DXR) is available to be loaded with new data. XRDY goes active as soon as 
the data is loaded into the transmit-shift register (XSR). The last word may still 
be shifting out when XRDY goes active. If DXR is loaded before the last word 
has completed transmission, the data bits transmitted are consecutive; that is, 
the LSB of the first word immediately precedes the MSB of the second, with 
all signaling valid as in two separate transmits. XRDY goes inactive when DXR 
is loaded and remains inactive until the data is loaded into the shifter. 


The receive ready (RRDY) signal is active as long as a new word of data is 
loaded into the data-receive register and has not been read. As soon as the 
data is read, the RRDY bit is turned off. 


When FSxX is specified as an output, the activity of the signal is determined by 
the internal state of the serial port. If a fixed data rate is specified, FSX goes 
active when DXR is loaded into XSR. One serial-clock cycle later FSX turns 
inactive and data transmission begins. If a variable data rate is specified, the 
FSX pin is activated when the data transmission begins and remains active 
during the entire transmission of the word. Again, the data is transmitted one 
clock cycle after it is loaded into the data-transmit register. 


An input FSX in the fixed data-rate mode must go active for at least one serial- 
clock cycle and then inactive to initiate the data transfer. The transmitter then 
sends the number of bits specified by the XLEN (bit field 19, serial-port global- 
control register) bits. In the variable data-rate mode, the transmitter begins 
sending from the time FSX goes active until the number of specified bits have 
been shifted out. In the variable data-rate mode, when the FSX status changes 
prior to all the data bits being shifted out, the transmission completes, and the 
DX pin is placed in a high-impedance state. An FSR input is exactly comple- 
mentary to the FSX. 


When using an external FSX, if DXR and XSR are empty, a write to DXR results 
in a DXR-to-XSR transfer. This data is held in the XSR until an FSX occurs. 
When the external FSX is received, the XSR begins shifting the data. If XSR is 
waiting for the external FSX, a write to DXR changes DXR, but a DXR-to-XSR 
transfer does not occur. XSR begins shifting when the external FSX is received, 
or when it is reset using XRESET. 
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12.2.10.1 Continuous Transmit and Receive Modes 


When you choose continuous mode, consecutive writes do not generate or 
expect new sync pulse signaling. Only the first word of a block begins with 
an active synchronization. Thereafter, data is transmitted as long as new 
data is loaded into DXR before the last word has been transmitted. As soon 
as TXRDY is active and all of the data has been transmitted out of the shift 
register, the DX pin is placed in a high-impedance state, and a subsequent 
write to DXR initiates a new block and a new FSX. 


Similarly with FSR, the receiver continues shifting in new data and loading 
DRR. If the data-receive buffer is not read before the next word is shifted in, 
you will lose subsequent incoming data. You can use the RFSM bit to terminate 
the receive-continuous mode. 


12.2.10.2 Handshake Mode 


The handshake mode (HS = 1) allows for direct connection between processors. 
In this mode, all data words are transmitted with a leading 1 (see Figure 12-23). 
For example, in order to transmit an 8-bit word, the first bit sent is a 1, followed 
by the 8-bit data word. 


Once the serial port transmits a word in this mode, it does not transmit another 
word until it receives a separately transmitted 0 bit. Therefore, the 1 bit that 
precedes every data word is a request bit. 


Figure 12-23. Data Word Format in Handshake Mode 


 — Data word (8 bits) | 
| | 


DX | 


Leading 1 


After a serial port receives a word that has been read from the DRR (with the 
leading 1), the receiving serial port sends a single 0 to the transmitting serial 
port. The single 0 bit acts as an acknowledge bit (see Figure 12-24). This 
single acknowledge bit is sent every time the DRR is read, even if the DRR 
does not contain new data. 


Figure 12-24. Single 0 Sent as an Acknowleage Bit 


—— 
S) 


ingle 0 
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When the serial port is placed in the handshake mode, the insertion and deletion 
of a leading 1 for transmitted data, the sending of a 0 for acknowledgement of 
received data, and the waiting for this acknowledge bit are all performed auto- 
matically. Using this scheme, it is simple to connect processors with no external 
hardware and to guarantee secure communication. Figure 12-25 is a typical 
configuration. 


In the handshake mode, FSX is automatically configured as an output. Con- 
tinuous mode is automatically disabled. After a system reset or XRESET, the 
transmitter is always permitted to transmit. The transmitter and receiver must 
be reset when entering the handshake mode. 


Figure 12-25. Direct Connection Using Handshake Mode 


12.2.11 Serial-Port Interrupt Sources 
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A serial port has the following interrupt sources: 


Lj) Transmit-timer interrupt. The rising edge of XTSTAT causes a single- 
cycle interrupt pulse to occur. When XTINT is 0, this interrupt pulse is 
disabled. 


Lj Receive-timer interrupt. The rising edge of RTSTAT causes a single- 
cycle interrupt pulse to occur. When RTINT is 0, this interrupt pulse is 
disabled. 


(1 Transmitier-interrupt. Occurs immediately following a DXR-to-XSR 
transfer. The transmitter interrupt is a single-cycle pulse. When the serial- 
port global-control register bit XINT is 0, this interrupt pulse is disabled. 


L] Receiver-interrupt. Occurs immediately following an RSR-to-DRR trans- 
fer. The receiver interrupt is a single-cycle pulse. When the serial-port 
global-control register bit RINT is 0, this interrupt pulse is disabled. 


The transmit-timer interrupt pulse is ORed with the transmitter interrupt pulse 
to create the CPU-transmit interrupt flag XINT. The receive-timer interrupt pulse 
is ORed with the receiver interrupt pulse to create the CPU receive-interrupt flag 
RINT. 
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12.2.12 Serial-Port Functional Operation 


The following paragraphs and figures illustrate the functional timing of the 
various serial-port modes of operation. The timing descriptions are presented 
with the assumption that all signal polarities are configured to be positive (that 
is, CLKXP = CLKRP = DXP = DRP = FSXP = FSRP = 0). Logical timing, in situ- 
ations where one or more of these polarities are inverted, is the same except 
with respect to the opposite polarity reference points (that is, rising vs. falling 
edges, etc.). 


These discussions pertain to the numerous operating modes and configurations 
of the serial-port logic. When it is necessary to switch operating modes or 
change configurations of the serial port, you should do so only when XRESET 
or RRESET are asserted (low), as appropriate. When transmit configurations 
are modified, XRESET should be low, and when receive configurations are 
modified, RRESET should be low. When you use handshake mode, however, 
since the transmitter and receiver are interrelated, you should make any configu- 
ration changes with XRESET and RRESET both low. 


All of the serial-port operating configurations can be classified in two categories: 
fixed data-rate timing and variable data-rate timing. Both categories support 
operation in either burst or continuous mode. 


Burst-mode operation with variable data-rate timing is similar to burst-mode 
operation with fixed data-rate timing. With variable data-rate timing, however, 
FSX/R and data timing differ slightly at the beginning and end of transfers. 
Specifically, there are three major differences between fixed and variable data- 
rate timing: 


LJ FSX/R pulses typically last for the entire transfer interval in variable data- 
rate timing operation, although FSR and external FSX are ignored after 
the first bit transferred. FSX/R pulses in fixed data-rate mode typically last 
only one CLKX/R cycle but can last longer. 


(J With variable data-rate timing, data transfer begins during the CLKX/R 
cycle in which FSX/R occurs. With fixed data-rate timing, data transfer be- 
gins in the CLKX/R cycle following FSX/R. 


(J With variable data-rate timing, frame sync inputs are ignored until the end 
of the last bit transferred. With fixed data-rate timing, frame sync inputs are 


ignored until the beginning of the last bit transferred. 


The following paragraphs discuss fixed and variable data-rate operation and all 
of their variations. 
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12.2.12.1 Fixed Data-Rate Timing Operation 


Fixed data-rate serial-port transfers can occur in two varieties: burst mode and 
continuous mode. In burst mode, transfers of single words are separated by 
periods of inactivity on the serial port. In continuous mode, there are no gaps 
between successive word transfers; the first bit of a new word is transferred 
on the next CLKX/R pulse following the last bit of the previous word. This occurs 
continuously until the process is terminated. The following variations are 
included in fixed data-rate timing operations. 


(J Fixed Burst Mode 


In burst mode with fixed data-rate timing, FSX/FSR pulses initiate transfers, 
and each transfer involves a single word. With an internally generated FSX 
(see Figure 12-26), transmission is initiated by loading DXR. In this mode, 
there is a delay of approximately 2.5 CLKX cycles (depending on CLKX and 
H1 frequencies) from the time DXR is loaded until FSX occurs. With an 
external FSX, the FSX pulse initiates the transfer, and the 2.5-cycle delay 
effectively becomes a setup requirement for loading DXR with respect to 
FSX. In this case, you must load DXR no later than three CLKX cycles before 
FSX occurs. Once the XSR is loaded from the DXR, an XINT is generated. 


Figure 12-26. Fixed Burst Mode 


FSR/FSX (external) 


CLKX/R J l J l j l j l J l J l j l J | j | j l R/XVAREN = 0 
R/XFSM = 0 


WAISeananeetne 
RK 


FSX (internal) J l 


12-36 


SS 
DXR loaded XINT RINT 


In receive operations, once a transfer is initiated, FSR is ignored until the 
last bit. For burst-mode transfers, FSR must be low during the last bit, or 
another transfer will be initiated. After a full word has been received and 
transferred to the DRR, an RINT is generated. 


(1 Fixed Standard Mode 


In fixed data-rate mode, you can perform continuous transfers even if 
R/XFSM = 0, as long as properly timed frame synchronization is provided, 
or as long as DXR is reloaded each cycle with an internally generated FSX 
(see Figure 12-27). 
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Figure 12-27. Fixed Standard Mode With Back-to-Back Frame Sync 
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For receive operations and with externally generated FSX, once transfers 
have begun, frame sync pulses are required only during the last bit trans- 
ferred to initiate another contiguous transfer. Otherwise, frame sync inputs 
are ignored. Continuous transfers occur if the frame sync is held high. With 
an internally generated FSX, there is a delay of approximately 2.5 CLKX 
cycles from the time DXR is loaded until FSX occurs. This delay occurs 
each time DXR is loaded; therefore, during continuous transmission, the 
instruction that loads DXR must be executed by the N-3 bit for an N-bit 
transmission. Since delays due to pipelining vary, you should incorporate 
a conservative margin of safety in allowing for this delay. 


Once the process begins, an XINT and an RINT are generated at the begin- 
ning of each transfer. The XINT indicates that the XSR has been loaded from 
DXR and can be used to cause DXR to be reloaded. To maintain continuous 
transmission in fixed rate mode with frame sync, especially with an internally 
generated FSX, DXR must be reloaded early in the ongoing transfer. 


The RINT indicates that a full word has been received and transferred into 
the DRR; RINT indicates an appropriate time to read DRR. 


Continuous transfers are terminated by discontinuing frame sync pulses 
or, in the case of an internally-generated FSX, not reloading DXR. 


Fixed Continuous Mode 


You can accomplish continuous serial-port transfers, without the use of 
frame sync pulses, if R/XFSM is set to 1. In this mode, operation of the serial 
port is similar to continuous operation with frame sync, except that a frame 
sync pulse is involved only in the first word transferred, and no further frame 
sync pulses are used. Following the first word transferred (see 
Figure 12-28), no internal frame sync pulses are generated, and frame 
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sync inputs are ignored. Additionally, you should set R/XFSM prior to or 
during the first word transferred; you must set R/XFSM no later than the 
transfer of the N—1 bit of the first word, except for transmit operations. For 
transmit operations in the fixed data-rate mode, XFSM must be set no later 
than the N-2 bit. You must clear R/XFSM no later than the N-1 bit to be 
recognized in the current cycle. 


Figure 12-28. Fixed Continuous Mode Without Frame Sync 
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The timing of RINT and XINT and data transfers to and from DXR and 
DRR, respectively, are the same as in fixed data-rate standard mode with 
back-to-back frame syncs. This mode of operation also exhibits the same 
delay of 2.5 CLKX cycles after DXR is loaded before an internal FSX is 
generated. As in the case of continuous operation in fixed data-rate mode 
with frame sync, you must reload DXR no later than transmission of the 
N-3 bit. 


Enabling or Disabling Frame Syncs in Fixed Mode 


When you use continuous operation in fixed data-rate mode, you can set 
and clear R/XFSM as desired, even during active transfers, to enable or 
disable the use of frame sync pulses as dictated by system requirements. 
Under most conditions, changing the state of R/XFSM occurs during the 
transfer in which the R/KFSM change was made, provided the change 
was made early enough in the transfer. For transmit operations with inter- 
nal FSX in fixed data-rate mode, however, a 1-word delay occurs before 
frame sync pulse generation resumes when clearing XFSM to 0 (see 
Figure 12-29). In this case, one additional word is transferred before the 
next FSX pulse is generated. The clearing of XFSM is recognized during 
the transmission of the word currently being transmitted as long as XFSM 
is cleared no later than the N—1 bit. The setting of XFSM is recognized as 
long as XFSM is set no later than the N-2 bit. 
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Figure 12-29. Exiting Fixed Continuous Mode Without Frame Sync, FSX Internal 


1st word 2nd word 3rd word Ath word 5th word 


ou LLLP LPL Pe 


FSX 
(internal) __ | rs rn LJ Lf |__| 


DX ——————— «BNK CX CNX D1) XBOKEDG XERXES XEN 


[| ot 


LOAD DXR SET XFSM RESET XFSM 


12.2.12.2 Variable Data-Rate Timing Operation 
The following variations are included in variable data-rate timing operations. 
_] Variable Burst Mode 


In burst mode with variable data-rate timing, FSX/FSR pulse lasts for the 
entire duration of transfer. With an internally generated FSX (see 
Figure 12-30), transmission is initiated by loading DXR. In this mode there 
is a delay of approximately 3.5 CLKX cycles (depending on CLKX and H1 
freqency) from the time DXR is loaded until FSX occurs. With an external 
FSX, the FSX pulse initiates the transfer and the 3.5-cycle delay effectively 
becomes a setup requirement for loading DXR with respect to FSX. There- 
fore, in this case, you must load DXR no later than four CLKX cycles before 
FSX occurs. Once the XSR is loaded from the DXR, an XINT is generated. 


Figure 12-30. Variable Burst Mode 
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Figure 12-31. 
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(j Variable Standard Mode 


When you transmit continuously in variable data-rate mode with frame sync, 
timing is the same as for fixed data-rate mode, except for the differences 
between these two modes as described in Section 12.2.12 Serial-Port 
Functional Operation, on page 12-35. The only other exception is that you 
must reload DXR no later than the N-4 bit to maintain continuous opera- 
tion of the variable data-rate mode (see Figure 12-31); you must reload 
DXR no later than the N-3 bit to maintain continuous operation of the fixed 
data-rate mode. 


Variable Standard Mode With Back-to-Back Frame Syncs 
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Continuous operation in variable data-rate mode without frame sync (see 
Figure 12-32) is similar to continuous operation without frame sync in 
fixed data-rate mode (see Figure 12-28). As with variable data-rate stan- 
dard mode with back-to-back frame syncs, you must reload DXR no later 
than the N-4 bit to maintain continuous operation. Additionally, when 
R/XFSM is set or cleared in the variable data-rate mode, you must make 
the modification no later than the N—1 bit for the result to be affected in the 
current transfer. 


Figure 12-32. Variable Continuous Mode Without Frame Sync 
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12.2.13 Serial-Port Initialization/Reconfiguration 


Serial Ports 


R/XVAREN = 1 
R/XFSM = 1 


The serial ports are controlled through memory-mapped registers on the dedi- 
cated peripheral bus. A general procedure for initializing and/or reconfiguring 


the serial ports follows. 


1) Halt the serial port by clearing the XRESET and/or RRESET bits of the serial- 
port global-control register. To do this, write a 0 to the serial-port global- 


control register. The serial ports are halted on RESET. 


2) Configure the serial port via the serial-port global-control register (with 
XRESET = RRESET = 0) and the FSX/DX/CLKX and FSR/DR/CLKR port- 
control registers. If necessary, configure the receive/transmit registers; 
timer control (with XHLD = RHLD = 0), timer counter, and timer period. 


Refer to section 12.2.14 for more information. 


3) Start the serial-port operation by setting the XRESET and RRESET bits 
of the serial-port global-control register and the XHLD and RHLD bits of 
the serial-port receive/transmit timer-control register, if necessary. 


12.2.14 TMS320C3x Serial-Port Interface Examples 


In addition to the examples presented in this section, you can find DMA/serial 


port initialization examples in Example 12-9 and Example 12- 


12-78 and 12-79, respectively. 


10 on pages 
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12.2.14.1_ Handshake Mode Example 


When using the handshake mode, the transmit (FSX/DS/CLKX) and receive 
(FSR/DR/CLKR) signals transmit and receive data, respectively. Even if the 
’C8x serial port is receiving data only with handshake mode, the transmit signals 
are still needed to transmit the acknowledge signal. Example 12-3 shows the 
serial-port register setup for the ’C3x serial-port handshake communication, as 
shown in Figure 12-25 on page 12-34. 


Example 12-3. Serial-Port Register Setup #1 


Global control 


Transmit port control 
Receive port control 
S port timer control 
S port timer count 

S port timer period 


011x0x0xxxx00000000xx01100100b, 
0111h 


Vio ow 
fo) 
ra 
sy 


(if two C3xs have the same system clock). 


Note: x= user-configurable 
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Since FSX is set as an output and continuous mode is disabled when hand- 
shake mode is selected, follow these steps: 


1) Set the XFSM and RFSM bits to 0 and the FSXOUT bit to 1 in the global- 
control register. 


2) Set the XRESET, RRESET, and HS bits to 1 in order to start the handshake 
communication. 


3) Set the polarity of the serial-port pins active (high) for simplification. 


4) Although the CLKX/CLKR can be set as either input or output, set the 
CLKX as output and the CLKR as input. 


The rest of the bits are user-configurable as long as both serial ports have 
consistent setup. 


You need the serial-port timer only if the CLKX or CLKR is configured as an 
output. Since only the CLKX is configured as an output, set the timer control 
register to OFh. When you use the serial-port timer, set the serial timer register 
to the proper value for the clock speed. The serial-port timer clock speed setup 
is similar to the ’'C3x timer. Refer to Section 12.1, Timers, on page 12-2 for 
detailed information on timer clock generation. 


The maximum clock frequency for serial transfers is f(CLKIN)/4 if the internal 
clock is used and f(CLKIN)/5.2 if an external clock is used. If two ’C3xs have 
the same system clock, the timer-period register should be set equal to or 
greater than 1, which makes the clock frequency equal to f(CLKIN)/8. 
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Example 12-4 and Example 12-5 are serial-port register setups for the above 
case. (Assume two ’C3xs have the same system clock.) 


Example 12-4. Serial-Port Register Setup #1 


Global control OEBCO064h; 32 bits, fixed data rate, burst mode, 
Transmit port control O111h ; FSX (output), CLKX (output) = F(CLKIN) /8 
Receive port control O111lh ; CLKR (input), handshake mode, transmit 


S port timer 
S port timer 
S port timer 


control 
count 
period 


Vio uo ou ou 


OFh; and receive interrupt is enabled. 
Oh 
Olh 


Example 12-5. Serial-Port Register Setup #2 


Global control = 0C000364h; 8 bits, variable data rate, burst mode, 
Transmit port control = 0111h; FSX (output), CLKX (output) = f(CLKIN) /24 
Receive port control = O111h ; CLKR (input), handshake mode, transmit 

S port timer control = OFh; and receive interrupt is disabled. 

S port timer count = Oh 

S port timer period 2 Olh 


Since the data has a leading 1 and the acknowledge signal is a 0 in the hand- 
shake mode, the ’C3x serial port can distinguish between the data and the 
acknowledge signal. Even if the ’C3x serial port receives the data before the 
acknowledge signal, the data is not misinterpreted as the acknowledge signal 
and lost. Additionally, the acknowledge signal is not generated until the data 
is read from the data-receive register (DRR); the 'C3x does not transmit the 
data and the acknowledge signal simultaneously. 


12.2.14.2 CPU Transfer With Serial Port Transmit Polling Method 


Example 12-6 sets up the CPU to transfer data (128 words) from an array buffer 
to the serial port 0 output register when the previous value stored in the serial- 
port output register has been sent. Serial port 0 is initialized to transmit 32-bit 
data words with an internally generated frame sync and a bit-transfer rate of 
8H1 cycles/bit. 
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Example 12-6. CPU Transfer With Serial Port Transmit Polling Method 


* TITLE: CPU TRANSFER WITH SERIAL-PORT TRANSMIT POLLING METHOD 


* 


-GLOBAL START 


-DATA 
SOURCE -WORD _ARRAY 
-BSS _ARRAY,128 ; DATA ARRAY LOCATED IN .BSS SECTION 
; THE UNDERSCORE USED IS JUST TO MAKE IT 
; ACCESSIBLE FROM C (OPTIONAL) 
SPORT -WORD 808040H ; SERIAL-PORT GLOBAL CONTROL REG ADDRESS 
SPRESET .WORD 008C0044 ; SERIAL-PORT RESET 
SGCCTRL .WORD 048C0044H ; SERIAL-PORT GLOBAL CONTROL REG INITIALIZATION 
SXCTRL -WORD 111H ; SERIAL-PORT TX PORT CONTROL REG INITIALIZA- 
TION 
STCTRL -WORD OOFH ; SERIAL-PORT TIMER CONTROL REG INITIALIZATION 
STPERIOD .WORD 00000002h ; SERIAL-PORT TIMER PERIOD 
RESET -WORD OH ; SERIAL-PORT TIMER RESET VALUE 
. TEXT 
START LDP RESET ; LOAD DATA PAGE POINTER 
ANDN 10H, IE ; DISABLE SERIAL-PORT TRANSMIT INTERRUPT TO CPU 


* SERIAL PORT INITIALIZATION 


LDI @SPORT,AR1 
LDI @RESET, RO 
LDI 4,IRO 
STI RO,*+AR1 (IRO) ; SERIAL-PORT TIMER RESET 
LDI @SPRESET, RO 
STI RO, *AR1 ; SERIAL-PORT RESET 
LDI @SXCTRL, RO ; SERIAL-PORT TX CONTROL REG INITIALIZATON 
STI RO,*+AR1 (3) 
LDI @STPERIOD, RO ; SERIAL-PORT TIMER PERIOD INITIALIZATION 
STI RO, *+ARI1 (6) 
LDI @STCTRL,RO ; SERIAL-PORT TIMER CONTROL REG INITIALIZATION 
STI RO,*+AR1 (4) 
LDI @SGCCTRL, RO ; SERIAL-PORT GLOBAL CONTROL REG INITIALIZATION 
STI RO,*AR1L 
* CPU WRITES THE FIRST WORD 
LDI @SOURCE, ARO 
LDI *ARO++,R1 
STI R1,*+AR1 (8) 
* CPU WRITES 127 WORDS TO THE SERIAL PORT OUTPUT REG 


LDI 8,IRO 
LDI 2,R0 
LDI 126,RC 
RPTB LOOP 
WAIT AND *AR1,RO0,R2 ; WAIT UNTIL XRDY BIT = 1 
BZ WAIT 
LOOP STI R1,*+AR1(IRO) 
|| LDI *++ARO(1),R1 
BU $ 
. END 
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12.2.14.3, DMA Transfer With Serial Port Interrupt 


Example 12-8 and Example 12-9 of Section 12.3.11 on page 12-76 use the 
DMA synchronized to serial port interrupts to transfer data (128 words) from 
an array buffer to the serial portO output register. 


12.2.14.4 Serial Analog Interface Chips Interface Example 


The TLC320C4x analog interface chips (AIC) from Texas Instruments offer a 
zero-glue-logic interface to the ’C3x family of DSPs. The interface is shown in 
Figure 12-33 as an example of the ‘C8x serial-port configuration and operation. 


Figure 12-33. TMS320C3x Zero-Glue-Logic Interface to TLC320C4x Example 
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The ’C8x resets the AIC through the external pin XFO. It also generates the 
master clock for the AIC through the timer 0 output pin, TCLKO. (Precise selec- 
tion of a sample rate may require the use of an external oscillator rather than 
the TCLKO output to drive the AIC MCLK input.) In turn, the AIC generates the 
CLKRO and CLKXO shift clocks as well as the FSRO and FSX0 frame synchro- 
nization signals. 


A typical use of the AIC requires an 8-kHz sample rate of the analog signal. 
If the clock input frequency to the ’C3x device is 30 MHz, you should load the 
following values into the serial port and timer registers. 


Serial Port: 

Port global-control register 0E970300h 
FSX/DX/CLKX port-control register 00000111h 
FSR/DR/CLKR port-control register 00000111h 
Timer: 

Timer global-control register 000002C1h 
Timer-period register 00000001h 
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12.2.14.5 Serial Analog-to-Digital (A/D) and Digiial-to-Analog (D/A) Interface Example 


The DSP201/2 and DSP101/2 family of D/As and A/Ds from Burr Brown also 
offer a zero-glue-logic interface to the ‘C3x family of DSPs. The interface is 
shown in Example 12-7. This interface is used as an example of the ’C3x serial 
port configuration and operation. 


Example 12-7. TMS320C3x Zero-Glue-Logic Interface to Burr Brown A/D and D/A 


Burr Brown DSP102 A/D Burr Brown DSP202 D/A 


+3V 


+3V 


1 MQ 


12.29 MHz 


+--+ 
22 pF >5 Sx 22pF 
Vv Vv 


The DSP102 A/D is interfaced to the ’C3x serial-port receive side; the DSP202 
D/A is interfaced to the transmit side. The A/Ds and D/As are hard-wired to run 
in cascade mode. In this mode, when the ’C3x initiates a convert command to 
the A/D via the TCLKO pin, both analog inputs are converted into two 16-bit 
words, which are concatenated to form one 32-bit word. 


1) The A/D signals the ’C3x via the A/D’s SYNC signal (connected to the 
FSRO pin) that serial data is to be transmitted. 


2) The 32-bit word is then serially transmitted, MSB first, out the SOUTA serial 
pin of the DSP102 to the DRO pin of the ’C3x serial port. 


3) The ’C3x is programmed to drive the analog interface bit clock from the 
CLKXO pin of the ’C8x. 
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4) The bit clock drives both the A/D’s and D/A’s XCLK input. 


5) The ’C3x transmit clock also acts as the input clock on the receive side of 
the ’C3x serial port. 


6) Since the receive clock is synchronous to the internal clock of the ’C3x, the 
receive clock can run at full speed (that is, f(H1)/2). 


Similarly, on receiving a convert command, the pipelined D/A converts the last 
word received from the ’C3x and signals the ’C3x via the SYNC signal (Connected 
to the ’C3x FSX0 pin) to begin transmitting a 32-bit word representing the two 
channels of data to be converted. The data transmitted from the ’C3x DX0 pin is 
input to both the SINA and SINB inputs of the D/A as shown in Example 12-7. 


The ’C8x is set up to transfer bits at the maximum rate of about 8 Mbps, with a 
dual-channel sample rate of about 44.1 kHz. Assuming a 32-MHz CLKIN, you 
can configure this standard-mode fixed-data-rate signaling interface by setting 
the registers as described below: 


Serial Port: 

Port global-control register OEBC0040h 
FSX/DX/CLKX port-control register 00000111h 

FSR/DR/CLKR port-control register 00000111h 

Receive/transmit timer-control register OO00000Fh 
Timer: 

Timer global-control register 000002C1h 
Timer-period register 000000B5h 
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12.3 DMA Controller 


The DMA controller is a programmable peripheral that transfers blocks of data 
to any location in the memory map without interfering with CPU operation. The 
’C3x can interface to slow, external memories and peripherals without reducing 
throughput to the CPU. The ’C3x DMA controller features are: 


(J Transfers to and from anywhere in the processor's memory map. For 
example, transfers can be made to and from on-chip memory, off-chip 
memory, and on-chip serial ports. 


[1 One DMA channel for memory-to-memory transfers in ‘C30 and 
‘C31/VC33. Two DMA channels for memory-to-memory transfers in ‘C32. 


[J Concurrent CPU and DMA controller operation with DMA transfers at the 
same rate as the CPU (supported by separate internal DMA address and 
data buses). 


41 Source and destination-address registers with auto increment/decrement. 


J Synchronization of data transfers via external and internal interrupts. 


12.3.1 DMA Functional Description 
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The DMA controller supports one (‘C30 and ‘C31/VC33) or two (‘C32) DMA chan- 
nels that perform transfers to and from anywhere in the ‘C3x memory map. 


Each DMA channel is controlled by four registers that are mapped in the ‘C3x 
peripheral address space, as shown in Figure 12-35. The major DMA registers 
are described in Section 12.3.3. 


The DMA controller has dedicated on-chip address and data buses (see 
Figure 2-5 through Figure 2—7 on pages 2-14 through 2-16 for a block dia- 
gram of the peripherals of the ‘C3x). All accesses made by the DMA channels 
are arbitrated in the DMA controller and take place over these dedicated 
buses. The DMA channels transfer data in a sequential time-slice fashion, 
rather than simultaneously, because they share common buses. 


The DMA channels can run constantly or can be triggered by external (INT3-0) 
or internal (on-chip timers and serial ports) interrupts. 
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12.3.1.1 TMS320C30 and TMS320C31/VC33 DMA Controller ‘cn: ‘oat; 


The ’C30 and ’C31/VC33 have an on-chip direct memory access (DMA) con- 
troller that reduces the need for the CPU to perform input/output functions. The 
DMA controller can perform input/output operations without interfering with the 
operation of the CPU. Therefore, it is possible to interface the ‘C30 and 
°C31/VC33 to slow external memories and peripherals (A/Ds, serial ports, etc.) 
without reducing the computational throughput of the CPU. The result is im- 
proved system performance and decreased system cost. 


12.3.1.2 TMS320C32 Two-Channel DMA Controller 


The ’C32 has an improved DMA that supports two channels and configurable 
priorities. The next sections discuss the new features. 


The ’C32 has a two-channel (channel 0 and channel 1) DMA instead of a one- 
channel DMA as in the ’C30/’C31/VC33 devices. The ’C32’s DMA functions 
similarly to that of the °C30/C31/VC33 DMA but with the addition of DMA/CPU 
priority scheme and inter-DMA priority mode. Although the ‘C32 CPU supports 
both floating-point and integer data access with different data size from the ex- 
ternal memory, the ’'C32’s DMA transfer is strictly an integer data transfer. The 
integer data access of the C32 DMA is the same as the CPU integer data access 
— 32-bit internal and data size conversion at the external memory interface port. 
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12.3.2 DMA Basic Operation 
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If a block of data is to be transferred from one region in memory to another region 
in memory (as shown in Figure 12-34), the following sequence is performed: 


DMA Registers Initialization 


1) 


The source-address register of a DMA channel is loaded with the address 
of the memory location to read from. 


The destination-address register of the same DMA channel is loaded with 
the address of the memory location to write to. 


The transfer counter is loaded with the number of words to be transferred. 


The DMA channel control register is loaded with the appropriate modes 
to synchronize the DMA controller reads and writes with interrupts. 


DMA Start 


5) 


The DMA controller is started through the DMA START field in the DMA 
channel control register. 


Word Transfers 


6) 


7) 


The DMA channel reads a word from the source-address register and 
writes it to a temporary register within the DMA channel. 


After a read by the DMA channel, the source-address register is increm- 
ented, decremented, or unchanged depending on the INCSRC or 
DECSRC bit fields of DMA channel control register. 


After the read operation completes, the DMA channel writes the temporary 
register value to the destination-address pointed to by the destination- 
address register. 


After the destination-address has been fetched, the transfer-counter reg- 
ister is decremented and the destination-address register is incremented, 
decremented, or unchanged, depending on the INCDST or DECDST bit 
fields of the DMA channel control register. 


10) During every data write, the transfer counter is decremented. The block 


transfer terminates when the transfer counter reaches zero and the write 
of the last transfer is completed. The DMA channel sets the transfer-counter 
interrupt (TCINT) flag in the DMA channel control register. 
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After the completion of a block transfer, the DMA controller can be programmed 
to do several things: 


_) Stop until reprogrammed (TC = 1) 
_j) Continue transferring data (TC = 0) 


_) Generate an interrupt to signal the CPU that the block transfer is complete 
(TCINT = 1) 


The DMA can be stopped by setting the START bits to 00, 01, or 10. When the 
DMA is restarted (START = 11), it completes any pending transfer. 


Figure 12-34. DMA Basic Operation 


External or Internal 
memory 


DMA Memory pointed to by DMA 
channel source-address register 


Temporary register 


[> External or Internal 
memory 


Memory pointed to by DMA 
destination-address register 


12.3.3 DMA Registers 
Each DMA channel has four registers designated as follows: 


_) Control register: contains the status and mode information about the 
associated DMA channel 


_) Source-address register: contains the memory address of data to be 
read 


_j Destination-address register: contains the memory address where data 
is written 


_j Transfer-counter register: contains the block size to move 


After reset, the control register, the transfer counter, and the auxiliary transfer- 
counter registers are set to Os and the other registers are undefined. 


Figure 12-36 shows these registers for ‘C30 and ’C31/VC33. Figure 12-37 
shows these registers for ’C32. 


The format of the DMA-channel control register is shown in Figure 12-35. The 
text following the figure describes the functions of each field in the register. 
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At reset, each DMA-channel control register is set to 0. This makes the DMA 
channels lower-priority than the CPU, sets up the source address and destination 
address to be calculated through linear addressing, and configures the DMA 
channel in the unified mode. 


Figure 12-35. Memory-Mapped Locations for DMA Channels 
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Address Register 
808000h DMA 0 global control 


808004h DMA 0 source address 


808006h DMA 0 destination address 
808008h DMA 0 transfer counter 


808010h DMA 1 global controlt 


808014h DMA 1 source addresst 


808016h DMA 1 destination addresst 


808018h DMA 1 transfer countert 


tT 'C32 only 


12.3.3.1 DMA Global-Control Register 


DMA Controller 


The global-control register controls the state in which the DMA controller 
operates. This register also indicates the status of the DMA, which changes 
every cycle. Source and destination addresses can be incremented, decrem- 
ented, or synchronized using specified global-control register bits. At system 
reset, all bits in the DMA control register are cleared to 0. Figure 12-36 shows 
the global-control registers for the ‘C30 and ’C31/VC33 devices. Figure 12-37 
and Figure 12-38 show the global-control registers for the ‘C32. Table 12-6 
shows the register bits, bit names, and bit functions. 


Figure 12-36. TMS320C30 and TMS320C31/VC33 DMA Global-Control pegig Se 


31 15114 12) 44 10 9 a. % 6 5 4 2 2 1 0 
me Ne 'TCINT | TC | SYNC 4 DECDST| INCDST | DECSRC | INCSRC STAT START 
t 
i , RW RW RW RW R/W R/W RW +! OR R/W 
Notes: 1) R = read, W=write 
2) xx = reserved bit, read as 0 
Figure 12-37. TMS320C32 DMAO Global-Control Register 
31 15) 14 13 124 41 10 9 84 7 6 5 4,8 21 0 
'PRIORITY| DMAO # ' ' 
x 1 MODE PRI + TOINT | TC | SYNC DECDST | INCDST | DECSRC | INCSRC, STAT | START 
: RW RW 4 RW RW RW 4. RW R/W R/w RW {RR R/W 
Notes: 1) R = read, W=write 
2) xx = reserved bit, read as 0 
Figure 12-38. TMS320C32 DMA1 Global-Control Register 
31 15414 is 12% 1 10 9 8 7 6 5 4 12s 24 0 
: DMA1 ! : ' 
KF xx pai 'TCINT | TC | SYNC ! DECDST | INCDST | DECSRC Incsrc 1 STAT START 
‘ | | 4 
‘ RW 4, RW RW RW . RW R/W R/wW Rw {oR R/W 
Notes: 1) R = read, W=write 
2) xx = reserved bit, read as 0 
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Table 12-6. DMA Global-Control Register Bits Summary 


Abbreviation 


START 


STAT 
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Reset 
Value 


00 


00 


Name 


DMA start control 


DMA status 


Description 


Controls the state in which the DMA starts and stops. The 
DMA may be stopped without any loss of data. 


The following table summarizes the START bits and DMA 
operation: 


Bit1 BitO Function 


0 0 DMA read or write cycles in progress are 
completed; any data read is ignored. Any 
pending read or write is cancelled. The 
DMA is reset so that when it starts, a new 
transaction begins; thatis, a read is per- 
formed (Reset value). 


0 1 If aread or write has begun, it is completed 
before it stops. If a read or write has not 
begun, no read or write is started. 


1 0 If a DMA transfer has begun, the entire 
transfer is complete (including both read 
and write operations) before stopping. If a 
transfer has not begun, none is started. 


1 1 DMA starts from reset or restarts from the 
previous state. 


When the DMA completes a transfer, the START bits remain 
in 11 (base 2). The DMA starts when the START bits are set 
to 11 and one of the following conditions applies: 

(J The transfer counter is set to a value different from Ox0. 
Lj The TC bit is set to 0. 


Indicates the status of the DMA and changes every cycle. 
The following table summarizes the STAT bits and DMA status. 
Bit3 Bit2 Function 


0 0 The DMA is being held between DMA 
transfer (between a write and a read). This 
is the value at reset. 


0 1 DMA is being held in the middle of a DMA 
transfer (between a read and a write). 


1 0 Reserved. 


1 1 DMA busy. DMA is performing a read or 
write or waiting for a source or destination 
synchronization interrupt. 
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Table 12-6. DMA Global-Control Register Bits Summary (Continued) 


Abbreviation 
INCSRC 


DECSRC 


INCDST 


DECDST 


SYNC 


TC 


TCINT 


Reset 
Value 


0 


0 


0 


Name 


DMA source address 
increment mode 


DMA source address 
decrement mode 


DMA destination 
address increment 
mode 


DMA destination 
address decrement 
mode 


DMA synchronization 
mode 


DMA transfer mode 


DMA transfer counter 
interrupt mode 


Description 

If INCSRC = 1, the source address is incremented after every 
read. 

If DECSRC = 1, the source address is decremented after 
every read. 

If INCSRC = DECSRGC, the source address is not modified 
after a read. 


If INCDST = 1, the destination address is incremented after 
every write. 


lf DECDST = 1, the destination address is decremented after 
every write. 


If INCDST = DECDST, the destination address is not modified 
after a write. 

Determines the timing synchronization between the events 
initiating the source and destination transfers. 


The following table summarizes the SYNC bits and DMA 
synchronization. 


Bit9 Bit8 Function 

0 0 No synchronization. Enabled interrupts 
are ignored (reset value). 

0 1 Source synchronization. A read is per- 
formed when an enabled interrupt occurs. 

1 0 Destination synchronization. A write is per- 


formed when an enabled interrupt occurs. 


Source and destination synchronization. A 
read is performed when an enabled interrupt 
occurs. A write is then performed when the 
next enabled interrupt occurs. 


Affects the operation of the transfer counter. 


If TC = 0, transfers are not terminated when the transfer 
counter becomes 0. 


lf TC = 1, transfers are terminated when the transfer 
counter becomes 0. 


If TCINT = 1, the DMA interrupt is set when the transfer 
counter makes a transition to 0. 


If TCINT = 0, the DMA interrupt is not set when the transfer 
counter makes a transition to 0. 
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Table 12-6. DMA Global-Control Register Bits Summary (Continued) 


Reset 
Abbreviation Value Name Description 
Sout DMAO PRI 00 CPU/DMA channel 0 (on the DMAO control register) (C32 only) 


priority mode 
00 CPU/DMA channel 1 (on the DMA1 control register) (‘C32 only) 


pionmy meds Configures CPU/DMA controller priority. (Gee Section 12.3.6 
on page 12-64). 


The following table explains the DMA PRI bits and CPU/ 


DMA priorities. 

Bit13 Bit12 Function 

0 0 DMA has lower priority than the CPU 
access. If the DMA channel and the CPU 
are requesting the same resource, the 
CPU has priority (reset value). 

0 1 Reserved. 

1 0 Rotating arbitration, which sets priorities 


be tween the CPU and DMA channel by 
alternating their accesses (but not exactly 
equally). Priority rotates between the CPU 
and DMA accesses when they conflict 
during consecutive instruction cycles. 


1 1 DMA has higher priority than the CPU 
access. Ifthe DMA channel and the CPU 
are requesting the same resource, the 
DMA has priority. 


mn 
C29] PRIORITY 0 DMA channels priority If PRIORITY MODE = 0, fixed priority for the two DMA chan- 
MODE mode nels. DMA channel 0 always has priority over DMA channel 1. 


If priority mode = 1, rotating priority for the two DMA channels. 
DMA channel 0 has priority after the device is reset. After 
reset, the last channel serviced has the lowest priority. The ar- 
bitration is performed at DMA service boundaries, that is, 
after either a DMA read or DMA write. 


See Section 12.3.5 on page 12-62 for more information. 
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12.3.3.2 Destination-Address and Source-Address Registers 


The DMA destination-address and source-address registers are 24-bit registers 
whose contents specify destination and source addresses. As specified by 
control bits DECSRC, INCSRC, DECDST, and INCDST of the DMA global- 
control register, these registers are incremented, decremented, or remain 
unchanged at the end of the corresponding memory access; that is, the source 
register for a read and the destination register for a write (see Figure 12-39). 
On system reset, 0 is written to these registers. 


Figure 12-39. DMA Controller Address Generation 


DMA 0 1 -1 
address DMA source-address register INCSRC 
bus DECSRC 
DMA source-address 
generator 
DMA 0 1-1 
address DMA destination-address register INCDST 
bus DECDST 


DMA destination-address 
generator 
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12.3.3.3 Transfer-Counter Register 
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The transfer-counter register is a 24-bit register that contains the number of 
words to be transmitted. Figure 12-40 shows the transfer-counter operation. 
It is controlled by a 24-bit counter that decrements at the beginning of a DMA 
memory write. In this way, it can control the size of a block of data transferred. 
The transfer-counter register is set to 0 at system reset. When the TCINT bit 
of the DMA global-control register is set, the transfer-counter register causes 
a DMA interrupt flag to be set when 0 is reached. 


The counter is decremented after completing the destination-address fetch. 
The interrupt is generated after the transfer counter is decremented and after 
the completion of the write of the last transfer. 


The decrementer checks whether the transfer counter equals 0 after the decre- 
ment is performed. As a result, if the counter register has a value of 1, then the 
DMA channel can be halted after only one transfer is performed. Thus, by set- 
ting the transfer counter to 1, the DMA channel transfers the minimum possible 
number of words (1 time). The value of the transfer counter is treated as an un- 
signed integer. Transfers can be halted when a 0 value is detected after a decre- 
ment. If the DMA controller channel is not halted after the transfer reaches zero, 
the counter continues decrementing below 0. Thus, by setting the transfer 
counter to 0, the DMA channel transfers the maximum possible number of 
words (100 O000h times). 
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Figure 12-40. Transfer-Counter Operation 


Transfer-counter register 
Decrementer 


Compare 
to 0 
? 
Yes 
Yes 


DMA interrupt generated 


12.3.4 CPU/DMA Interrupt-Enable Register 


The CPU/DMA interrupt-enable register (IE) is a 32-bit register located in the 
CPU register file. The CPU interrupt-enable bits are in locations 10-1. The DMA 
interrupt-enable bits are in locations 26-16. A 1 in a CPU/DMA interrupt-enable 
register bit enables the corresponding interrupt. A 0 disables the corresponding 
interrupt. At reset, 0 is written to this register. 


Figure 12-41 shows the CPU/DMA interrupt-enable registers for the ‘C30 and 


‘C31/VC33. Figure 12-42 shows the CPU/DMA interrupt-enable register for the 
‘C32. Table 12-7 describes the register bits, bit names, and bit functions. 
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Figure 12-41. TMS320C30 and TMS320C31/VC33 CPU/DMA Scat 
Interrupt-Enable Register 
31 30 29 281 27 26 25 24 | 23 22 a4 20 } 19 18 17 16 
: EDINT | ETINT1 | ETINTO! ERINT1 | EXINT1 | ERINTO| EXINTO' EINT3 | EINT2 | EINT1 | EINTO 
7 ORY OT OK) (DMA) | (DMA) | (DMA) 1 (DMA) | (DMA) | (DMA) | (DMA) « (DMA) | (DMA) | (DMA) | (DMA) 
‘ R/W R/W Rw 1 RW R/W R/W RW. RW R/W R/W R/W 
15 14 13 12811 10 9 Ss . Fe 6 5 4 ' 3 2 1 0 
' EDINT | ETINT( | ETINTO' ERINT1 | EXINT1 | ERINTO| EXINTO' EINT3 | EINT2 | EINT1 | EINTO 
OK) OK) KE HUNKT (CPU) | (CPU) | (CPU) , (CPU) | (CPU) | (CPU) | (CPU) , (CPU) | (CPU) | (CPU) | (CPU) 
i] 
i] 


’ 
' R/W R/W R/W R/W R/W R/W R/IW « R/W R/W R/W R/W 


Notes: 1) R = read, W=write 
2) xx = reserved bit, read as 0 


Figure 12-42. TMS320C32 CPU/DMA Interrupt-Enable Register 


31 30 29 28 1 27 26 25 24 4% 23 22 21 20 19 18 ble 16 
4 ¢ 
EINT3 EINT2 EINT1 | EINTO \ EDINTO | EDINT1 | ETINT1] ETINTO® ETINT1 | ETINTO] ERINTO | EXINTOs EINT3 | EINT2 | EINT1 | EINTO 
(DMA1) | (DMA1) | (DMAt)| (OMAt) # (DMA1) | (DMAQ) | (DMAO)| (DMA)  (DMA1) | (DMA1) | (DMA1) | (DMAO) § (DMAO) | (DMA0) | (OMAo)| (DMA0) 
+ 
R/W R/W R/W R/W # RW R/W R/W R/W ' R/W R/W R/W R/W ; R/W R/W R/W R/W 
‘ ‘ ‘ 
15 14 13 12, 1 10 9 Bor ae 6 5 4 4 3 2 1 0 
xx x x xX ; EDINT1 | EDINTO ETINT1 ETINTO | oe ee ERINTO|} EXINTO i EINT3 | EINT2 | EINT1 EINTO 
4 (CPU) (CPU) (CPU) (CPU) ‘ (CPU) (CPU) E (CPU) | (CPU) | (CPU) (CPU) 
’ ‘ ‘ ‘ i 
1 R/W R/W R/W R/W ‘ R/W R/IW «a RW R/W R/W R/W 


Notes: 1) R = read, W=write 
2) xx = reserved bit, read as 0 
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Table 12-7. CPU/DMA Interrupt-Enable Register Bits 


Abbreviation 
EINTO (CPU 
EINT1 (CPU 
EINT2 (CPU 
EINT3 (CPU 
EXINTO (CPU 
ERINTO (CPU 
EXINT1 (CPU) 
ERINT1 (CPU) 
ETINTO (CPU 
ETINT1 (CPU 
EDINT (CPU) 


) 
) 
) 
) 
) 
) 
) 
) 


EDINTO (CPU) 
EDINT1 (CPU) 
EINTO (DMA) 


EINT1 (DMA) 
EINT2 (DMA) 
EINT3 (DMA) 


EINTO (DMAO 
EINT1 (DMAO 
EINT2 (DMAO 

(DMAO 
EXINTO (DMA) 


) 
) 
) 
) 


ERINTO (DMA) 


EXINT1 (DMA) 


Reset 


Value Description 


0 


a oO co Oo oo 2 oOo oOo OD © 


oO 


Oo oo oo © © 


CPU external interrupt 0 enable 

CPU external interrupt 1 enable 

CPU external interrupt 2 enable 

CPU external interrupt 3 enable 

CPU serial port 0 transmit interrupt enable 

CPU serial port 0 receive interrupt enable 

CPU serial port 1 transmit interrupt enable (’C30 only) 
CPU serial port 1 receive interrupt enable (’C30 only) 
CPU timer0 interrupt enable 

CPU timer interrupt enable 


CPU DMA controller 
°C31/VC33 only) 


CPU DMAO controller interrupt enable (’C32 only) 


interrupt enable (’'C30 and 


CPU DMA1 controller interrupt enable (’C32 only) 


DMA external interrupt 0 enable (C30 and ’C31/VC33 
only) 


DMA external interrupt 1 enable (’C30 and ’C31/VC33 
only) 


DMA external interrupt 2 enable (’C30 and ’C31/VC33 
only) 


DMA external interrupt 3 enable (C30 and ’C31/VC33 
only) 


DMAO external interrupt 0 enable (C32 only) 
DMAO external interrupt 1 enable (’C32 only) 
DMAO external interrupt 2 enable (’C32 only) 
DMAO external interrupt 3 enable (’C32 only) 


DMA serial port 0 transmit interrupt enable 
(‘C30 and ’C31/VC33 only) 


DMA serial port 0 receive interrupt enable 
(‘C30 and ’C31/VC33 only) 


DMA serial port 1 transmit interrupt enable (’C30 only) 
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Table 12-7. CPU/DMA Interrupt-Enable Register Bits (Continued) 


Reset 
Abbreviation Value Description 


ERINT1 (DMA) 0 DMA serial port 1 receive interrupt enable (’C30 only) 


( 
EXINTO (DMAO) 0 DMAO serial port 1 transmit interrupt enable (’C32 only) 
ERINTO (DMA1) 0 DMA‘1 serial port 1 receive interrupt enable (’C32 only) 
ETINTO (DMA) 0 DMA timer0 interrupt enable (C30 and ’C31/VC33) 
ETINT1 (DMA) 0 DMA timer1 interrupt enable (C30 and ’C31/VC33 only) 
ETINTO (DMAO) 0 DMAO timer1 interrupt enable (’C32 only) 
ETINT1 (DMAO) 0 DMAO timer1 interrupt enable (’C32 only) 
ETINTO (DMA1) 0 DMA1 timerO interrupt enable (’C32 only) 
ETINT1 (DMA1) 0 DMA‘1 timer1 interrupt enable (’C32 only) 
EDINT (DMA) 0 DMA controller interrupt enable (‘C30 and ’C31/VC33) 
EDINT1 (DMAO) 0 DMAO-DMA1 controller interrupt enable (’C32 only) 
EDINTO (DMA1) 0 DMA1-DMA0 controller interrupt enable (’C32 only) 
EINTO (DMA1) 0 DMA1 external interrupt 0 enable (’C32 only) 
EINT1 (DMA1) 0 DMA1 external interrupt 1 enable (’C32 only) 
EINT2 (DMA1) 0 DMA1 external interrupt 2 enable (’C32 only) 
EINT3 (DMA1) 0 DMA1 external interrupt 2 enable (’C32 only) 


12.3.5 TMS320C32 DMA Internal Priority Schemes 


Because all accesses made by the two DMA channels take place over one 
common internal DMA data and address bus, a priority scheme for bus arbitra- 
tion is required. Within the DMA controller, two priority schemes are used to 
designate which channel is serviced next: 


L) A fixed priority scheme with channel 0 always having the highest priority 
and channel 1 the lowest 


_j A rotating priority scheme that places the most recently serviced channel 
at the bottom of the priority list (default setup after reset) 


12.3.5.1 Fixed Priority Scheme 


This scheme provides a fixed (unchanging) priority for each channel as follows: 
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Priority Channel 
Highest 0 
Lowest 1 


To select fixed priority, set the PRIORITY MODE bit (bit 14) of channel 0’s 
DMA-channel control register to 1. 
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12.3.5.2 Rotating Priority Scheme 


In a rotating priority scheme, the last channel serviced becomes the lowest 
priority channel. The other channel sequentially rotates through the priority list 
with the lowest channel next to the last-serviced channel becoming the highest 
priority on the following request. The priority rotates every time the channel 
most recently granted priority completes its access. At system reset, the 
channels are ordered from highest to lowest priority (0, 1). 


To select this scheme, set the PRIORITY MODE bit (bit 14) of channel 0’s DMA 
control register to 0. 


12.3.6 CPU and DMA Controller Arbitration 
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The DMA controller transfers data on its own internal buses. Arbitration is neces- 
sary only when a resource conflict exists between the DMA controller and the 
CPU. The arbitration causes no delay. When there is no conflict, the CPU and 
DMA controller accesses proceed in parallel. 


All arbitration between the CPU and the DMA controller is on an access basis. 
DMA controller internal memory access starts during H3 (see Section 8.5, 
Clocking Memory Access, for more information). 


When the CPU and DMA controllers request the same resource, priority is 
determined as follows: 


J For the ‘C30 and ‘C31/VC33, the CPU always has higher priority, thus the 
DMA must wait until the CPU frees the resource. 


_) For the ‘C32, the DMA channel’s DMA PRI bits (bits 12 and 13 of the channel 
control register) define the arbitration rules (as shown in Table 12-8). The 
CPU has higher priority than the DMA when DMA PRI = 00s; it has lower 
priority than the DMA when DMA PRI = 119. They rotate priority wnen DMA 
PRI = 019. 
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Table 12-8. TMS320C32 DMA PRI Bits and CPU/DMA Arbitration Rules 


DMA PRI 
(Bits 13-12) 


00 


01 


10 


12.3.7 DMA and Interrupts 


Description 


DMA access is lower priority than the CPU access. If the DMA 
channel and the CPU request the same resource, then the CPU 
has priority. (DMA PRI bits are set to 005 at reset.) 


This setting selects rotating arbitration, which sets priorities between 
the CPU and DMA channel by alternating their accesses, but not 
exactly equally. Priority rotates between CPU and DMA accesses 
when they conflict during consecutive instruction cycles. The first 
time the DMA channel and the CPU request the same resource, 
the CPU has priority. If, in the following instruction cycle, the DMA 
controller and the CPU again request the same resource, the DMA 
has priority. Alternate access continues as long as the CPU and 
DMA requests conflict in consecutive instruction cycles. When 
there is no conflict in a previous instruction cycle, the CPU has 
priority. 


Reserved 


DMA access is higher priority than the CPU access. If the DMA 
channel and the CPU request the same resource, the DMA has 
priority. 


The DMA controller uses interrupts in the following way: 


[J It can send interrupts to the CPU or other DMA channel when a block 
transfer finishes. See the TCINT bit field in the DMA global-control register 
(Figure 12-36, Figure 12-37, or Figure 12-38 on page 12-53). The 
EDINT bit field (C30 and ’C31/VC33) or the EDINTO and EDINT1 bit fields 
(C32) in the interrupt-enable register must be set to allow the CPU to be 
interrupted by the DMA. 


J It can receive interrupts from the external interrupt pins (INT3—0), the 
timers, the serial ports, or other DMA channel. 


This section explains how the DMA receives interrupts. This process is called 


synchronization. 


All of the interrupts that the DMA controller receives are detected by the CPU 
interrupt controller and latched by the CPU in the appropriate interrupt-flag 


register. 
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The DMA and the CPU can respond to the same interrupt if the CPU is not 
involved in any pipeline conflict or in any instruction that halts instruction fetching. 
Refer to section 7.6.2, Interrupt Vector Table and Prioritization, on page 7-29 for 
more details. It is also possible for different DMA channels to respond to the same 
interrupt. If the same interrupt is selected for source and destination synchroniza- 
tion, both read and write cycles are enabled with a single incoming interrupt. 


12.3.7.1_ Interrupts and Synchronization of DMA Channels 


You can use interrupts to synchronize DMA channels. This section describes 
the following four synchronization mechanisms: 


(1 No synchronization (SYNC = 0 0) 


When SYNC = 0 0, no synchronization is performed. The DMA performs 
reads and writes whenever there are no conflicts. All interrupts are ignored 
and are considered to be globally disabled. However, no bits in the DMA 
interrupt-enable register are changed. Figure 12-43 shows the synchro- 
nization mechanism when SYNC = 0 0. 


Figure 12-43. Mechanism for No DMA Synchronization 
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DMA channel performs a read 
DMA channel performs a write 
Go to start 


(J Source synchronization (SYNC = 0 1) 


When SYNC = 0 1, the DMA is synchronized to the source (see 
Figure 12-44). A read is not performed until an interrupt is received by the 
DMA. Then all DMA interrupts are disabled globally. However, no bits in 
the DMA interrupt-enable register are changed. 
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Figure 12-44. Mechanism for DMA Source Synchronization 
Idle until enabled interrupt is received 
Disable DMA interrupts globally 


v 


Clear corresponding IF bit 
DMA channel performs a read 


Enable DMA interrupts globally 


v 


DMA channel performs a write 


Go to start 


(J Destination synchronization (SYNC = 1 0) 


When SYNC = 1 0, the DMA is synchronized to the destination. First, all 

interrupts are ignored until the read is complete. Though the DMA interrupts 

are considered globally disabled, no bits in the DMA interrupt-enable regis- 

ter are changed. A write is not performed until an interrupt is received by the 

DMA, while the read is performed without waiting for the interrupt. 

Figure 12-45 shows the synchronization mechanism when SYNC = 1 0. 
Figure 12-45. Mechanism for DMA Destination Synchronization 


DMA channel performs a read 
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(1 Source and destination synchronization (SYNC = 1 1) 


When SYNC = 1 1, the DMA is synchronized to both the source and 
destination. A read is performed when an interrupt is received. Then, a 
write is performed on the following interrupt. Figure 12-46 shows source 
and destination synchronization when SYNC = 1 1. 


Figure 12-46. Mechanism for DMA Source and Destination Synchronization 
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DMA channel performs a read 


Enable DMA interrupts globally 


Disable DMA interrupts globally 
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Clear corresponding IF bit 
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DMA channel performs a write 


Enable DMA interrupts globally 


Go to start 


12.3.8 DMA Memory Transfer Timing 


The ’C30 and ’C31/VC33 devices provide one DMA channel, while the ’C32 
device provides two DMA channels. The maximum data transfer rate that the 
’C3x DMA sustains is one word every two cycles. In the ’C32, the two DMA 
channels transfer data in a sequential time-slice fashion, rather than simulta- 
neously, because the two channels share one common set of busses. 
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The data transfer rate for a DMA channel (assuming a single-channel access 
with no conflicts between CPU or other DMA channels) is as follows: 


J) On-chip memory and peripheral 


m DMAread: One cycle 
m DMAwrite: One cycle 


[J External memory (STRB, STRBO, STRB1, MSTRB) 


m DMAread: Two cycles (one cycle external read followed by one 
cycle load of internal DMA register) 


m DMAwrite: Two cycles (identical to CPU write) 
Lj) External memory (IOSTRB) 


m DMAread: Three cycles (two-cycle external read followed by one 
cycle load of internal DMA register) 


m DMAwrite: Two cycles (identical to CPU write) 


If the DMA started and is transferring data over either external bus, do not 
modify the bus-control register associated with that bus. If you must modify the 
bus-control register (see Chapter 9 or 10), stop the DMA, make the modifica- 
tion, and then restart the DMA. Failure to do this may produce an unexpected 
zero-wait-state bus access. 


DMA memory transfer timing can be very complicated, especially if bus resource 
conflicts occur. However, some rules help you calculate the transfer timing for 
certain DMA setups. For simplification, the following section focuses on a single- 
channel DMA memory transfer timing with no conflict with the CPU or other 
DMA channels. You can obtain the actual DMA transfer timing by combining the 
calculations for single-channel DMA transfer timing with those for bus resource 
conflict situations. 


12.3.8.1 Single DMA Memory Transfer Timing 


When the DMA memory transfer has no conflict with the CPU or any other 
DMA channels, the number of cycles of a DMA transfer depends on whether 
the source and destination location are designated as on-chip memory, 
peripheral, or external ports. When the external port is used, the DMA transfer 
speed is affected by two factors: the external bus wait state and the read/write 
conflict (for example, if a write is followed by a read, the read takes one extra 
half-cycle. See Figure 12-48 footnote on page 12-72). Figure 12-47 through 
Figure 12-49 show the number of cycles a DMA transfer requires from different 
sources to different destinations. Entries in the table represent the number of 
cycles required to do the T transfers, assuming that there are no pipeline conflicts. 
A timing diagram for the DMA transfers accompanies each figure. 
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12.3.9 DMA Initialization/Reconfiguration 


You can control the DMA through memory-mapped registers located on the 
dedicated peripheral bus. Following is the general procedure for initializing 
and/or reconfiguring the DMA: 


1) Halt the DMA by clearing the START bits of the DMA global-control register. 
You can do this by writing a 0 to the DMA global-control register. The DMA 
is halted on RESET. 


2) Configure the DMA through the DMA global-control register (with 
START = 00), as well as the DMA source, destination, and transfer- 
counter registers, if necessary. Refer to Section 12.3.11 on page 12-76 
for more information. 


3) Start the DMA by setting the START bits of the DMA global-control register 
as necessary. 


12.3.10 Hints for DMA Programming 


The following hints help you to improve your DMA programming and avoid un- 
expected results: 


_) Reset the DMA register before starting it. This clears any previously 
latched interrupt that may no longer exist. 


(J Inthe event of aCPU-DMA access conflict, the CPU always prevails. Care- 
fully allocate the different sections of the program in memory for faster 
execution. If a CPU program access conflicts with a DMA access, enabling 
the cache helps if the program is located in external memory. DMA on-chip 
access happens during the H3 phase. 


SSE Sh 
Note: Expansion and Peripheral Buses 


The expansion and peripheral buses cannot be accessed simultaneously 

because they are multiplexed into a common port (see Figure 2-1 on 

page 2-3). This might increase CPU-DMA access conflicts. 

| ee | 

(4 Ensure that each interrupt is received when you use interrupt synchroniza- 
tion; otherwise, the DMA will never complete the block transfer. 


_j Use read/write synchronization when reading from or writing to serial ports 
to guarantee data validity. 


The following are indications that the DMA has finished a set of transfers: 


(J The DINT bit in the IF register is set to 1 (interrupt polling). This requires 
that you set the TCINT bit in the DMA control register first. This interrupt- 
polling method does not cause any additional CPU-DMA access conflict. 
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(1 The transfer counter has a zero value. However, the transfer counter is 
decremented after the DMA read operation finishes (not after the write 
operation). Nevertheless, a transfer counter with a 0 value can be used as 
an indication of a transfer completion. 


(Lj The STAT bits in the DMA channel-control register are set to 002. You can 
poll the DMA channel-control register for this value. However, because the 
DMA registers are memory-mapped into the peripheral bus address 
space, this option can cause further CPU/DMA access conflicts. 


12.3.11 DMA Programming Examples 
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Example 12-8, Example 12-9, and Example 12-10 illustrate initialization 
procedures for the DMA. 


When linking the examples, you should allocate section memory addresses 
carefully to avoid CPU-DMA conflict. In the C3x, the CPU always prevails in 
cases of conflict. In the event of a CPU program/DMA data conflict, cache 
enabling helps if the .text section is in external memory. For example, when 
linking the code in Example 12-8, Example 12-9, and Example 12-10, the 
.text section can be allocated into RAMO, .data into RAM1, and .bss into 
RAM1, where RAMO corresponds to on-chip RAM block 0 and RAM1 corre- 
sponds to on-chip RAM block 1. 


In Example 12-8, the DMA initializes a 128-element array to 0. The DMA 
sends an interrupt to the CPU after the transfer is completed. This program as- 
sumes previous initialization of the CPU interrupt-vector table (specifically the 
DMA-to-CPU interrupt). The program initializes the ST and IE registers for 
interrupt processing. 


Example 12-8. Array Initialization With DMA 
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* TITLE: 


* 


DMA 
RESET 
CONTROL 
SOURCE 
DESTIN 
COUNT 
ZERO 


START 


ARRAY INITIALIZATION WITH 


-GLOBAL START 
-DATA 
. WORD 
. WORD 
- WORD 
. WORD 
. WORD 
- WORD 
. FLOAT 0.0 ; 
-BSS _ARRAY,128 ; 
. TEXT 


808000H ; 
0C40H ; 
0C43H ; 
ZERO ; 
_ ARRAY ; 
128 ; 


DMA ; 
@DMA, ARO ; 
@RESET, RO ; 
RO, *ARO 

I @SOURCE, RO ; 
I RO, *+ARO (4) 

I @DESTIN,RO ; 
RO, *+ARO (6) 

@COUNT, RO ; 
RO, *+ARO (8) 

400H, IE ; 
2000H, ST ; 
I @CONTROL, RO ; 
I RO, *ARO , 


DMA 

DMA GLOBAL 
DMA GLOBAL 
DMA 

DATA SOURC 
DATA 


NUMBER OF 
ARRAY INIT 


LOAD DATA 


POINT TO DMA GLOBAL CONTROL REGISTER 


RESET DMA 


INITIALIZE 


INITIALIZE 


-CONTROL REG ADDRESS 


-CONTROL REG RESET VALUE 
GLOBAL-CONTROL REG INITIALIZATION 


E ADDRESS 


DESTINATION ADDRESS 


WORDS TO TRANSFER 
IALIZATION VALUE 0.0 = 


PAGE POINTER 


INITIALIZE 


ENABLE INT 
ENABLE CPU 
INITIALIZE 
START DMA 


DMA 


ERRUPT FROM DMA TO CPU 
INTERRUPTS GLOBALLY 

DMA 
TRANSFER 


0x80000000 
DATA ARRAY LOCATED IN .BSS SECTION 


DMA SOURCE-ADDRESS REGISTER 


DMA DESTINATION-ADDRESS REGISTER 


TRANSFER COUNTER REGISTER 


GLOBAL CONTROL REGISTER 


Example 12-9 sets up the DMA to transfer data (128 words) from the serial 
port 0 input register to an array buffer with serial port receive interrupt (RINTO). 
The DMA sends an interrupt to the CPU when the data transfer completes. 


Serial port 0 is initialized to receive 32-bit data words with an internally generated 
receive-bit clock and a bit-transfer rate of 8H1 cycles/bit. 


This program assumes previous initialization of the CPU interrupt vector table 
(specifically the DMA-to-CPU interrupt). The serial port interrupt directly affects 
only the DMA; no CPU serial-port interrupt-vector setting is required. 
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Example 12-9. DMA Transfer With Serial-Port Receive Interrupt 


* TITLE DMA TRANSFER WI’ 


SPORT 
SGCCTRL 
SRCTRL 
STCTRL 
STPERIOD 
SPRESET 

RESET 


START 


* DMA INITI 


* SERIAL 


-GLOBAL START 


-DATA 
. WORD 
. WORD 
. WORD 
. WORD 
. WORD 
. WORD 
. WORD 


-BSS 


. WOR: 
- WOR: 
- WOR: 
. WOR: 
. WOR: 
. WOR: 
. WOR: 
. TEXT 


LDP 


LD] 
STI 


LDI 
STI 


PORT IN 


oe Ro MS. Me 


808000H 
0D43H 
80804CH 

_ ARRAY 
128 
00200400H 
0D40H 


_ARRAY, 128 


808040H 
0A300080H 
111H 

3COH 
00020000H 
01300080H 
OH 


DMA 


ALIZATION 


@DMA, ARO 


@SPOR 
@RESE 


@RESE 
RO, *A’ 
@SPRE 
RO, *A’ 
@SOUR 


@DEST 


@COUN 


2000H 
@CONT: 
RO,*A 


T,AR1 
T,RO 


RO, *+AR1 (4) 


T1,RO 
RO 
SET, RO 
R1 
CE, RO 


RO, *+ARO (4) 


IN,RO 


RO, *+ARO (6) 


T,RO 


RO, *+ARO (8) 
@IEVA 


L, IE 
,ST 
ROL, RO 
RO 


[TIALI ZATION 


RL, RO 


RO, *+AR1 (3) 


RIOD, RO 


RO, *+AR1 (6) 


RL, RO 


RO, *+AR1 (4) 
@SGCCTRL, RO 


RO,*A 


R1 


[TH SERIAL PORT RECEIVE INTERRUPT 


DMA GLOBAL-CONTROL REG ADDRESS 

; DMA GLOBAL-CONTROL REG INITIALIZATION 
DATA SOURCE-ADDRESS: SERIAL PORT INPUT REG 
DATA DESTINATION ADDRESS 
NUMBER OF WORDS TO TRANSFER 
IE REGISTER VALUE 
DMA RESET 


DATA ARRAY LOCATED IN .BSS SECTION 
THE UNDERSCORE USED IS JUST TO MAKE IT 
ACCESSIBLE FROM C (OPTIONAL) 


SERIAL-PORT GLOBAL-CONTROL REG ADDRESS 
SERIAL-PORT GLOBAL-CONTROL REG INITIALIZATION 
SERIAL-PORT RX PORT CONTROL REG INITIALIZATION 


SERIAL- POR IMER-CONTROL REG INITIALIZATION 
SERIAL- POR [MER PERIOD 

; SERIAL-PORT RESET 
SERIAL- POR MER RESET 


LOAD DATA PAGE POINTER 


POINT TO DMA GLOBAL CONTROL REGISTER 


RESET SPORT TIMER 


INITIALIZE DMA SOURCE-ADDRESS REGISTER 


INITIALIZE DMA DESTINATION-ADDRESS REGISTER 


INITIALIZE DMA TRANSFER COUNTER REGISTER 


ENABLE INTERRUPTS 

ENABLE CPU INTERRUPTS GLOBALLY 
[INITIALIZE DMA GLOBAL CONTROL REGISTER 
START DMA TRANSFER 


SERIAL-PORT RECEIVE CONTROL REG INITIALIZATION 


SERIAL- POR IMER-PERIOD INITIALIZATION 


SERIAL- POR IMER CONTROL REG INITIALIZATION 


SERIAL-PORT GLOBAL CONTROL REG INITIALIZATION 
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Example 12-10 sets up the DMA to transfer data (128 words) from an array 
buffer to the serial port 0 output register with serial port transmit interrupt XINTO. 
The DMA sends an interrupt to the CPU when the data transfer completes. 


Serial port 0 is initialized to transmit 32-bit data words with an internally gener- 
ated frame sync and a bit-transfer rate of 8(H1) cycles/bit. The receive-bit 
clock is internally generated and equal in frequency to one-half of the ’C3x H1 
frequency. 


This program assumes previous initialization of the CPU interrupt-vector table 
(specifically the DMA-to-CPU interrupt). The serial-port interrupt directly affects 
only the DMA; no CPU serial-port interrupt-vector setting is required. 


a | 


Note: Serial-Port Transmit Synchronization 


The DMA uses serial-port transmit interrupt XINTO to synchronize transfers. 
Because the XINTO is generated when the transmit buffer has written the last 
bit of data to the shifter, an initial CPU write to the serial port is required to 
trigger XINTO to enable the first DMA transfer. 


DMA Transfer With Serial-Port Transmit Interrupt 


* TITLE: DMA TRANSFER WITH SERIAL 
-GLOBAL START 


* 


DMA 
CONTROL 
SOURCE 
DESTIN 
COUNT 
IEVAL 


RESET1 
SPORT 
SGCCTRL 
SXCTRL 
STCTRL 
STPERIOD 
SPRESET 
RESET 


START 


.-DATA 
. WORD 


PORT TRANSMIT INTERRUPT 

808000H ; DMA GLOBAL-CONTROL REG ADDRESS 
OE13H ; DMA GLOBAL-CONTROL REG INITIALIZATION 
(_ ARRAY+1) ; DATA SOURCE ADDRESS 
80804CH ; DATA DESTIN ADDRESS: SERIAL-PORT OUTPUT REG 
127 ; NUMBER OF WORDS TO TRANSFER =(MSG LENGHT-1) 
00100400H ; IE REGISTER VALUE 
_ARRAY,128 ; DATA ARRAY LOCATED IN .BSS SECTION 

; THE UNDERSCORE USED IS JUST TO MAKE IT 

; ACCESSIBLE FROM C (OPTIONAL) 
OE10H ; DMA RESET 
808040H ; SERIAL-PORT GLOBAL-CONTROL REG ADDRESS 
048C0044H ; SERIAL-PORT GLOBAL-CONTROL REG INITIALIZATION 
111H ; SERIAL-PORT TX PORT CONTROL REG INITIALIZATION 
OOFH ; SERIAL-PORT TIMER CONTROL REG INITIALIZATION 
00000002H ; SERIAL-PORT TIMER PERIOD 
00880044H ; SERIAL-PORT RESET 
OH ; SERIAL-PORT TIMER RESET 

; LOAD DATA PAGE POINTER 
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Example 12-10. DMA Transfer With Serial-Port Transmit Interrupt (Continued) 


* DMA INITIALIZATION 


LDI @DMA, ARO 
LDI @SPORT, AR1 
LDI @RESET, RO 
STI RO, *+AR1 (4) 
STI RO, *ARO 

STI RO, *AR1 
LDI @SOURCE, RO 
STI RO, *+ARO (4) 
LDI @DESTIN, RO 
STI RO, *+ARO (6) 
LDI @COUNT, RO 
STI RO, *+ARO (8) 
OR @IEVAL,IE 
OR 2000H,ST 
LDI @CONTROL, RO 


; POINT TO DMA GLOBAL CONTROL REGISTER 


; RESET SPORT TIMER 

; RESET DMA 

; RESET SPORT 

; INITIALIZE DMA SOURCE-ADDRESS REGISTER 


; INITIALIZE DMA DESTINATION-ADDRESS REGISTER 


; INITIALIZE DMA TRANSFER COUNTER REGISTER 


; ENABLE INTERRUPT FROM DMA TO CPU 
; ENABLE CPU INTERRUPTS GLOBALLY 
; INITIALIZE DMA GLOBAL CONTROL REGISTER 


STI RO, *ARO 


* SERIAL PORT INITIALIZATION 


LDI @SXCTRL, RO 
STI RO, *+AR1 (2) 


LDI @STPERIOD, RO ; SERIAL-PORT TIMER-PERIOD INITIALIZATION 


STI RO, *+AR1 (6) 
LDI @STCTRL, RO 

STI RO, *+AR1 (4) 
LDI @SGCCTRL, RO 
STI RO, *AR1 


* CPU WRITES THE FIRST WORD (TRIGGERING EVENT ---> XINT IS GENERATED) 


zr 


LDI @SOURCE, ARO 
LDI *-ARO (1) ,RO 
STI RO, *+AR1 (8) 
BU $ 
. END 


; START DMA TRANSFER 


; SERIAL-PORT TX CONTROL REG INITIALIZATION 


; SERIAL-PORT TIMER-CONTROL REG INITIALIZATION 


; SERIAL-PORT GLOBAL-CONTROL REG INITIALIZATION 
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Other examples are as follows: 


L 


Transfer a 256-word block of data from off-chip memory to on-chip 
memory and generate an interrupt on completion. Maintain the memory 
order. 


DMA source address: 800000h 
DMA destination address: 809800h 
DMA transfer counter: 00000100h 
DMA global control: 00000C53h 


CPU/DMA interrupt enable (IE): 00000400h 


DMA Controller 


) Transfer a 128-word block of data from on-chip memory to off-chip 
memory and generate an interrupt on completion. Invert the memory or- 
der; the highest addressed member of the block is to become the lowest 
addressed member. 


DMA source address: 809800h 
DMA destination address: 800000h 
DMA transfer counter: 00000080h 
DMA global control: 00000C93h 


CPU/DMA interrupt-enable (IE): 00000400h 


J) Transfer a 200-word block of data from the serial-port 0 receive register 
to on-chip memory and generate an interrupt on completion. Synchronize 
the transfer with the serial-port 0 receive interrupt. 


DMA source address: 80804Ch 
DMA destination address: 809C00h 
DMA transfer counter: 000000C8h 
DMA global control: 00000D43h 


CPU/DMA interrupt-enable (IE): 00200400h 


(J Transfer a 200-word block of data from off-chip memory to the serial-port 0 
transmit register and generate an interrupt on completion. Synchronize with 
the serial-port 0 transmit interrupt. 


DMA source address: 809C00h 
DMA destination address: 808048h 
DMA transfer counter: 000000C8h 
DMA global control: 00000E13h 


CPU/DMA interrupt-enable (IE): 00400400h 


(J Transfer data continuously between the serial-port 0 receive register and 
the serial-port 0 transmit register to create a digital loop back. Synchronize 
with the serial-port 0 receive and transmit interrupts. 


DMA source address: 80804Ch 
DMA destination address: 808048h 
DMA transfer counter: 00000000h 
DMA global control: 00000303h 


CPU/DMA interrupt-enable (IE): 00300000h 
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Assembly Language Instructions 


The ’C3x assembly language instruction set supports numeric-intensive, signal- 
processing, and general-purpose applications. (The addressing modes used with 
the instructions are described in Chapter 5.) 


The ’C8x instruction set can also use one of 20 condition codes with any of the 
10 conditional instructions, such as LDF cond. This chapter defines the condition 
codes and flags. 


The assembler allows optional syntax forms to simplify the assembly language 
for special-case instructions. These optional forms are listed and explained. 


Each of the individual instructions is described and listed in alphabetical order 
(see subsection 13.6.2, Optional Assembler Syntax, on page 13-34). Example 
instructions demonstrate the special format and explain its content. 


This chapter discusses these topics: 


Topic Page 
13:1 \Instruction Set... sacs seme eecen es ncd ences wets demons nee 13-2 
13:2 Instruction’ Set: SUMAN y) eererrers serctelsl crete ste stele eres ateretete re (el -tereta= 13-10 
13.3 Parallel Instruction Set Summary ............0:0eeeeeeee eee eee 13-17 
13.4 Group Addressing Mode Instruction Encoding ................ 13-20 
13:5) Gondition’'Godes|andiFlags mari)-i octets ys sseeteeret ete etete eit 13-28 
13:6) Individual IMStruCtlOmsye ie ec cre ict yereteielo tte eielejoretsye elelejavayn a irvelelerels 13-32 


mnemonic = Running Title—Attribute Reference 


13.1 Instruction Set 


The ’C8x instruction set is well suited to digital signal processing and other 
numeric-intensive applications. All instructions are a single machine word 
long, and most instructions require one cycle to execute. In addition to multiply 
and accumulate instructions, the C3x possesses a full complement of general- 
purpose instructions. 


The instruction set contains 113 instructions organized into the following func- 
tional groups: 


Load and store 

2-operand arithmetic/logical 
3-operand arithmetic/logical 
Program control 

Interlocked operations 
Parallel operations 


i ae YO 


Each of these groups is discussed in the following subsections. 


13.1.1. Load and Store Instructions 


Table 13-1. 


Instruction 


LDE 
LDF 


LDFcond 


LDI 
LDIcond 
LDM 


LDP 
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The ’C3x supports 13 load and store instructions (see Table 13-1). These 
instructions can: 


_j Load a word from memory into a register 
_j Store a word from a register into memory 
_j Manipulate data on the system stack 


Two of these instructions can load data conditionally. This is useful for locating 
the maximum or minimum value in a data set. See Section 13.5 on page 13-28 
for detailed information on condition codes. 


Load and Store Instructions 


Description Instruction Description 

Load floating-point exponent POP Pop integer from stack 

Load floating-point value POPF Pop floating-point value from stack 
Load floating-point value PUSH Push integer on stack 
conditionally 

Load integer PUSHF Push floating-point value on stack 
Load integer conditionally STF Store floating-point value 

Load floating-point mantissa STI Store integer 


Load data page pointer 
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13.1.2 2-Operand Instructions 


Table 13-2. 


Instruction 
ABSF 


ABSI 
ADDct 
ADDFt 
ADDIt 
ANDt 
ANDNt 


ASHt 
CMPFt 
CMPIt 
FIX 


FLOAT 


LSHT 


MPYFt 


MPYIt 
NEGB 
NEGF 
NEGI 


The ’C3x supports 35 2-operand arithmetic and logical instructions. The two 
operands are the source and destination. The source operand can be a memory 
word, a register, or a part of the instruction word. The destination operand is 
always a register. 


As shown in Table 13-2, these instructions provide integer, floating-point or 
logical operations, and multiprecision arithmetic. 


2-Operand Instructions 


Description Instruction Description 

Absolute value of a floating- NORM Normalize floating-point value 

point number 

Absolute value of an integer NOT Bitwise-logical complement 

Add integers with carry ORT Bitwise-logical OR 

Add floating-point values RND Round floating-point value 

Add integers ROL Rotate left 

Bitwise-logical AND ROLC Rotate left through carry 

Bitwise-logical AND with ROR Rotate right 

complement 

Arithmetic shift RORC Rotate right through carry 

Compare floating-point values SUBBT Subtract integers with borrow 

Compare integers SUBC Subtract integers conditionally 

Convert floating-point value to SUBFT Subtract floating-point values 

integer 

Convert integer to floating-point SUBIt Subtract integer 

value 

Logical shift SUBRB Subtract reverse integer with 
borrow 

Multiply floating-point values SUBRF Subtract reverse floating-point 
value 

Multiply integers SUBRI Subtract reverse integer 

Negate integer with borrow TSTBt Test bit fields 

Negate floating-point value XORt Bitwise-exclusive OR 


Negate integer 


Tt 2- and 3-operand versions 
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13.1.3 3-Operand Instructions 


Whereas 2-operand instructions have a single source operand (or shift count) 
and a destination operand, 3-operand instructions can have two source operands 
(or one source operand and a count operand) and a destination operand. A source 
operand can be a memory word or a register. The destination of a 3-operand 
instruction is always a register. 


Table 13-3 lists the instructions that have 3-operand versions. You can omit 
the 3in the mnemonic from 3-operand instructions (See subsection 13.6.2 on 


page 13-34). 


Table 13-3. 3-Operand Instructions 


Instruction 


ADDC3 
ADDF3 
ADDI3 
AND3 
ANDN3 
ASH3 
CMPF3 
CMPI3 
LSH3 


Description Instruction Description 

Add with carry MPYF3 Multiply floating-point values 
Add floating-point values MPYI3 Multiply integers 

Add integers OR3 Bitwise-logical OR 
Bitwise-logical AND SUBB3 Subtract integers with borrow 
Bitwise-logical AND with complement SUBF3 Subtract floating-point values 
Arithmetic shift SUBI3 Subtract integers 

Compare floating-point values TSTB3 Test bit fields 

Compare integers XOR3 Bitwise-exclusive OR 


Logical shift 


13.1.4 Program-Control Instructions 
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The program-control instruction group consists of all of those instructions (17) 
that affect program flow. The repeat mode allows repetition of a block of code 
(RPTB) or of a single line of code (RPTS). Both standard and delayed 
(single-cycle) branching are supported. Several program-control instructions 
can perform conditional operations. (See Section 13.5 on page 13-28 
for detailed information on condition codes.) Table 13-4 lists the program- 
control instructions. 


Instruction Set Mnemonic 


Table 13-4. Program-Control Instructions 


Instruction Description Instruction Description 
Bcond Branch conditionally (standard) IDLE Idle until interrupt 
BconaD Branch conditionally (delayed) NOP No operation 
BR Branch unconditionally (standard) RETIcond Return from interrupt conditionally 
BRD Branch unconditionally (delayed) RETScond Return from subroutine conditionally 
CALL Call subroutine RPTB Repeat block of instructions 
CALLcond Call subroutine conditionally RPTS Repeat single instruction 
DBcond Decrement and branch conditionally || SWI Software interrupt 
(standard) 
DBconaD Decrement and branch conditionally || TRAPcond Trap conditionally 
(delayed) 
IACK Interrupt acknowledge 


13.1.5 Low-Power Control Instructions 


The low-power control instruction group consists of three instructions that affect 
the low-power modes. The low-power idle (IDLE2) instruction allows extremely 
low-power mode. The divide-clock-by-16 (LOPOWER) instruction reduces the 
rate of the input clock frequency. The _ restore-clock-to-regular-speed 
(MAXSPEED) instruction causes the resumption of full-speed operation. 
Table 13-5 lists the low-power control instructions. 


Table 13-5. Low-Power Control Instructions 


Instruction Description Instruction Description 


IDLE2 Low-power idle MAXSPEED Restore clock to regular speed 


LOPOWER Divide clock by 16 


13.1.6 Interlocked-Operations Instructions 


The five interlocked-operations instructions (Table 13-6) support multi- 
processor communication and the use of external signals to allow for powerful 
synchronization mechanisms. They also ensure the integrity of the communi- 
cation and result in a high-speed operation. Refer to Chapter 7 for examples 
of the use of interlocked instructions. 
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Instruction 


LDFI 
LDII 
SIGI 


Instruction Set 


Interlocked-Operations Instructions 


Description Instruction Description 
Load floating-point value, interlocked |} STFI Store floating-point value, interlocked 
Load integer, interlocked STII Store integer, interlocked 


Signal, interlocked 


13.1.7 Parallel-Operations Instructions 


The 13 parallel-operations instructions make a high degree of parallelism 
possible. Some of the ’C3x instructions can occur in pairs that are executed 
in parallel. These instructions offer the following features: 


_j Parallel loading of registers 
_j Parallel arithmetic operations 
1 Arithmetic/logical instructions used in parallel with a store instruction 


Each instruction in a pair is entered as a separate source statement. The second 
instruction in the pair must be preceded by two vertical bars (||). Table 13-7 lists 
the valid instruction pairs. 


Table 13-7. Parallel Instructions 
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(a) Parallel arithmetic with store instructions 


Mnemonic Description 
ABSF Absolute value of a floating-point number and store floating- 
| STF point value 
ABSI Absolute value of an integer and store integer 
| STI 
ADDF3 Add floating-point values and store floating-point value 
| STF 
ADDI3 Add integers and store integer 
| STI 
AND3 Bitwise-logical AND and store integer 
| STI 
ASH3 Arithmetic shift and store integer 
| STI 
FIX Convert floating-point to integer and store integer 
| STI 
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Table 13-7. Parallel Instructions (Continued) 


(a) Parallel arithmetic with store instructions (Continued) 


Mnemonic 


FLOAT 
|| STF 


LDF 
|| STF 


LDI 
|| STI 


LSH3 
|| STI 


MPYF3 
|| STF 


MPYI3 
|| STI 


NEGF 
|| STF 


NEGI 
|| STI 


NOT 
|| STI 


OR3 
|| STI 


STF 
|| STF 


STI 
|| STI 


SUBF3 
|| STF 


SUBI3 
|| STI 


XOR3 
|| STI 


Description 


Convert integer to floating-point value and store floating- 


point value 


Load floating-point value and store floating-point value 


Load integer and store integer 


Logical shift and store integer 


Multiply floating-point values and store floating-point value 


Multiply integer and store integer 


Negate floating-point value and store floating-point value 


Negate integer and store integer 


Complement value and store integer 


Bitwise-logical OR value and store integer 


Store floating-point values 


Store integers 


Subtract floating-point value and store floating-point value 


Subtract integer and store integer 


Bitwise-exclusive OR values and store integer 
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Table 13-7. Parallel Instructions (Continued) 


(b) Parallel load instructions 


Mnemonic Description 

LDF Load floating-point value 
|| LDF 

LDI Load integer 
|| LDI 


(c) Parallel multiply and add/subtract instructions 


Mnemonic Description 
MPYF3 Multiply and add floating-point value 
| ADDF3 
MPYF3 Multiply and subtract floating-point value 
| SUBF3 
MPYI3 Multiply and add integer 
| ADDI3 
MPYI3 Multiply and subtract integer 
| SUBI3 


These parallel instructions have been enhanced on the following devices: 


_j ’C31 silicon revision 6.0 or greater 
J ’C32 silicon revision 2.0 or greater 


These devices support greater combinations of operands by also allowing the 
use of any CPU register whenever an indirect operand is required. The particular 
instruction description details the operand combination. 


To support these new modes, you need to invoke the TMS320 floating-point 
code generation tools (version 5.0 or later) with the following switches: 


Li C Compiler 
m ‘C31: CL30 -v31l -gsrev6é 
m ‘C32: CL30 -v32 -gsrev2 


J Assembler 


m@ ‘C31: asm30 -v31l -msrev6é 
mB ‘C32: asm30 -v32 -msrev2 
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13.1.8 Illegal Instructions 


The ’C3x has no illegal instruction-detection mechanism. Fetching an illegal 
(undefined) opcode can cause the execution of an undefined operation. Proper 
use of the Tl TMS320 floating-point software tools will not generate an illegal 
opcode. Only the following conditions can cause the generation of an illegal 
opcode: 


_j Misuse of the tools 
_} An error in the ROM code 
_j Defective RAM 
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13.2 Instruction Set Summary 


Table 13-8 lists the ’C3x instruction set in alphabetical order. Each table entry 


provides the instruction mnemonic, description, and operation. 


Table 13-8. Instruction Set Summary 
Mnemonic Description Operation 
ABSF Absolute value of a floating-point number |src| > Rn 
ABSI Absolute value of an integer |src| > Dreg 
ADDC Add integers with carry src + Dreg + C > Dreg 
ADDC3 Add integers with carry (3-operand) srci + src2 + C > Dreg 
ADDF Add floating-point values src+Rn—- Rn 
ADDF3 Add floating-point values (3-operand) srcil + src2 — Rn 
ADDI Add integers src + Dreg — Dreg 
ADDI3 Add integers (3 operand) srcl + src2 + — Dreg 
AND Bitwise-logical AND Dreg AND src —> Dreg 
AND3 Bitwise-logical AND (3-operand) srci AND src2 — Dreg 
ANDN Bitwise-logical AND with complement Dreg AND src > Dreg 
ANDN3 Bitwise-logical ANDN (3-operand) sre1 AND src2 > Dreg 
ASH Arithmetic shift If count 2 0: 
(Shifted Dreg left by count) > Dreg 
Else: 
(Shifted Dreg right by |count|) > Dreg 
ASH3 Arithmetic shift (8-operand) If count = 0: 


(Shifted src left by count) > Dreg 
Else: 


(Shifted src right by |count|) + Dreg 


Legend: ARn auxiliary register n (AR7—ARO) RE repeat interrupt register 
Cc carry bit RM repeat mode bit 
Csrc conditional-branch addressing modes Rn register address (R7—RO) 
count shift value (general addressing modes) RS repeat start register 
cond condition code SP stack pointer 
Daddr destination memory address Sreg register address (any register) 
Dreg register address (any register) ST status register 
GIE global interrupt enable register src general addressing modes 
N any trap vector 0-27 srci 3-operand addressing modes 
PC program counter src2 3-operand addressing modes 
RC repeat counter register TOS top of stack 
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Table 13-8. Instruction Set Summary (Continued) 
Mnemonic Description 
Bcond Branch conditionally (standard) 
BconaD Branch conditionally (delayed) 
BR Branch unconditionally (standard) 
BRD Branch unconditionally (delayed) 
CALL Call subroutine 
CALLcond Call subroutine conditionally 
CMPF Compare floating-point values 
CMPF3 Compare floating-point values 
(3-operand) 
CMPI Compare integers 
CMPI3 Compare integers (3-operand) 
Legend: ARn auxiliary register n (AR7—ARO) 
Cc carry bit 
Csrc conditional-branch addressing modes 
count shift value (general addressing modes) 
cond condition code 
Daddr destination memory address 
Dreg register address (any register) 
GIE global interrupt enable register 
N any trap vector 0-27 
PC program counter 
RC repeat counter register 
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Operation 

If cond = true: 

If Csrcis a register, Csrc > PC 

If Csrcis a value, Csrc + PC — PC 
Else, PC + 1 > PC 


If cond = true: 

If Csrcis a register, Csrc > PC 

lf Csrcis a value, Csrc + PC + 3 — PC 
Else, PC + 1 > PC 


Value > PC 
Value > PC 


PC +1— TOS 
Value > PC 


If cond = true: 

PC +1— TOS 

If Csrcis a register, Csrc > PC 

If Csrcis a value, Csrc + PPC > PC 
Else, PC + 1 > PC 


Set flags on Rn - src 


Set flags on src1 — src2 


Set flags on Dreg — src 


Set flags on src1 — src2 


RE repeat interrupt register 

RM repeat mode bit 

Rn register address (R7—RO) 

RS repeat start register 

SP stack pointer 

Sreg register address (any register) 
ST status register 

src general addressing modes 
src 3-operand addressing modes 


src2 
TOS 


3-operand addressing modes 
top of stack 
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Decrement and branch conditionally 


RE repeat interrupt register 
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Table 13-8. Instruction Set Summary (Continued) 
Mnemonic Description 
DBcond 
(standard) 
DBconaD Decrement and branch conditionally 
(delayed) 
FIX Convert floating-point value to integer 
FLOAT Convert integer to floating-point value 
IACK Interrupt acknowledge 
IDLE Idle until interrupt 
IDLE2 Low-power idle 
LDE Load floating-point exponent 
LDF Load floating-point value 
LDFcond Load floating-point value conditionally 
LDFI Load floating-point value, interlocked 
LDI Load integer 
Legend: ARn auxiliary register n (AR7—ARO) 
Cc carry bit 
Csrc conditional-branch addressing modes 
count shift value (general addressing modes) 
cond condition code 
Daddr destination memory address 
Dreg register address (any register) 
GIE global interrupt enable register 
N any trap vector 0-27 
PC program counter 
RC repeat counter register 
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Operation 

ARn-1—ARn 

If cond = true and ARn 2 0: 

If Csrcis a register, Csrc > PC 

If Csrcis a value, Csrc+ PC +1— PC 
Else, PC + 1 > PC 


ARn-1-— ARn 

If cond = true and ARn 2 0: 

If Csrcis a register, Csrc > PC 

If Csrcis a value, Csrc+ PC +3—PC 
Else, PC + 1 — PC 

Fix (src) — Dreg 

Float(src) — Rn 


Dummy read of src 
IACK toggled low, then high 


PC +1—PC 


Idle until next interrupt 


Idle until next interrupt stopping internal clocks 


src(exponent) > Rn(exponent) 
src Rn 


If cond = true, src > Rn 


Else, Rnis not changed 


Signal interlocked operation src > Rn 


src — Dreg 

RM repeat mode bit 

Rn register address (R7—RO) 

RS repeat start register 

SP stack pointer 

Sreg register address (any register) 
ST status register 

src general addressing modes 
srci 3-operand addressing modes 


src2 
TOS 


3-operand addressing modes 
top of stack 
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Operation 
If cond = true, src + Dreg 


Else, Dreg is not changed 

Signal interlocked operation src + Dreg 
src (mantissa) — Rn (mantissa) 

src — data page pointer 

H1/16 > H1 


If count = 0: 
(Dreg left-shifted by count) > Dreg 
Else: 


(Dreg right-shifted by |count|) — Dreg 


If count = 0: 
(src left-shifted by count) > Dreg 
Else: 


(src right-shifted by |count|) - Dreg 
H1/16 > H1 

srcx Rn— Rn 

srci1 X src2 + Rn 

src xX Dreg — Dreg 

srci X src2 — Dreg 

0 - src- C > Dreg 

0-src— Rn 


0 - src > Dreg 


RE repeat interrupt register 

RM repeat mode bit 

Rn register address (R7—RO) 

RS repeat start register 

SP stack pointer 

Sreg register address (any register) 
ST status register 

src general addressing modes 
src 3-operand addressing modes 
src2 3-operand addressing modes 


TOS 


Table 13-8. Instruction Set Summary (Continued) 
Mnemonic Description 
LDIcond Load integer conditionally 
LDII Load integer, interlocked 
LDM Load floating-point mantissa 
LDP Load data page pointer 
LOPOWER Divide clock by 16 
LSH Logical shift 
LSH3 Logical shift (8-operand) 
MAXSPEED Restore clock to regular speed 
MPYF Multiply floating-point values 
MPYF3 Multiply floating-point value (3-operand) 
MPYI Multiply integers 
MPYI3 Multiply integers (3-operand) 
NEGB Negate integer with borrow 
NEGF Negate floating-point value 
NEGI Negate integer 
Legend: ARn auxiliary register n (AR7—ARO) 
Cc carry bit 
Csrc conditional-branch addressing modes 
count shift value (general addressing modes) 
cond condition code 
Daddr destination memory address 
Dreg register address (any register) 
GIE global interrupt enable register 
N any trap vector 0-27 
PC program counter 
RC repeat counter register 


top of stack 
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Table 13-8. Instruction Set Summary (Continued) 


Mnemonic 
NOP 


NORM 
NOT 
OR 
OR3 
POP 
POPF 
PUSH 
PUSHF 


RETIcond 


RETScond 


RND 
ROL 
ROLC 
ROR 
RORC 


Legend: 


Description Operation 
No operation Modify ARn if specified 
Normalize floating-point value Normalize (src) > Rn 
Bitwise-logical complement src —> Dreg 
Bitwise-logical OR Dreg OR src > Dreg 
Bitwise-logical OR (3-operand) src1 OR src2 — Dreg 
Pop integer from stack *SP-— — Dreg 
Pop floating-point value from stack *SP-- > Rn 
Push integer on stack Sreg — *++ SP 
Push floating-point value on stack Rn — *++ SP 
Return from interrupt conditionally If cond = true or missing: 
*SP-- > PC 
1 —> ST (GIE) 
Else, continue 
Return from subroutine conditionally If cond = true or missing: 
*SP-- > PC 
Else, continue 
Round floating-point value Round (src) > Rn 
Rotate left Dreg rotated left 1 bit — Dreg 
Rotate left through carry Dreg rotated left 1 bit through carry — Dreg 
Rotate right Dreg rotated right 1 bit  Dreg 
Rotate right through carry Dreg rotated right 1 bit through carry > Dreg 
ARn auxiliary register n (AR7—ARO) RE repeat interrupt register 
Cc carry bit RM repeat mode bit 
Csrc conditional-branch addressing modes Rn register address (R7—RO) 
count shift value (general addressing modes) RS repeat start register 
cond condition code SP stack pointer 
Daddr destination memory address Sreg register address (any register) 
Dreg register address (any register) ST status register 
GIE global interrupt enable register src general addressing modes 
N any trap vector 0-27 srci 3-operand addressing modes 
PC program counter src2 3-operand addressing modes 
RC repeat counter register TOS top of stack 
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Table 13-8. 

Mnemonic 

RPTB 

RPTS 

SIGI 

STF 

STFI 

STI 

STII 

SUBB 

SUBB3 

SUBC 

SUBF 

SUBF3 

Legend: ARn 
Cc 
Csrc 
count 
cond 
Daddr 
Dreg 
GIE 
N 
PC 
RC 


Description 


Repeat block of instructions 


Repeat single instruction 


Signal, interlocked 


Store floating-point value 


Store floating-point value, interlocked 


Store integer 


Store integer, interlocked 


Subtract integers with borrow 
Subtract integers with borrow (3-operand) 


Subtract integers conditionally 


Subtract floating-point values 
Subtract floating-point values (3-operand) 


auxiliary register n (AR7—ARO) 

carry bit 

conditional-branch addressing modes 
shift value (general addressing modes) 
condition code 

destination memory address 

register address (any register) 

global interrupt enable register 

any trap vector 0-27 

program counter 

repeat counter register 


Operation 

src > RE 

1 > ST (RM) 
Next PC > RS 


src + RC 

1 —> ST (RM) 
Next PC — RS 
Next PC — RE 


Signal interlocked operation 
Wait for interlock acknowledge 
Clear interlock 


Rn — Daddr 
Rn — Daddr 


Signal end of interlocked operation 
Sreg — Daddr 
Sreg — Daddr 


Signal end of interlocked operation 
Dreg - src- C > Dreg 
src1 — src2 —- C > Dreg 


If Dreg - src 2 0: 
[(Dreg — src) << 1] OR 1 — Dreg 


Else, Dreg << 1 — Dreg 
Rn-src— Rn 


srci — src2 > Rn 


RE repeat interrupt register 

RM repeat mode bit 

Rn register address (R7—RO) 

RS repeat start register 

SP stack pointer 

Sreg register address (any register) 
ST status register 

src general addressing modes 
src 3-operand addressing modes 
src2 3-operand addressing modes 


TOS top of stack 
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Table 13-8. Instruction Set Summary (Continued) 
Mnemonic Description 
SUBI Subtract integers 
SUBI3 Subtract integers (3-operand) 
SUBRB Subtract reverse integer with borrow 
SUBRF Subtract reverse floating-point value 
SUBRI Subtract reverse integer 
SWI Software interrupt 
TRAP cond Trap conditionally 
TSTB Test bit fields 
TSTB3 Test bit fields (3-operand) 
XOR Bitwise-exclusive OR 
XOR3 Bitwise-exclusive OR (3-operand) 
Legend: ARn auxiliary register n (AR7—ARO) 
Cc carry bit 
Csrc conditional-branch addressing modes 
count shift value (general addressing modes) 
cond condition code 
Daddr destination memory address 
Dreg register address (any register) 
GIE global interrupt enable register 
N any trap vector 0-27 
PC program counter 
RC repeat counter register 
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Operation 


Dreg — src — Dreg 

src — src2 — Dreg 

src — Dreg - C > Dreg 
src- Rn— Rn 

src — Dreg — Dreg 
Perform emulator interrupt sequence 
If cond = true or missing: 
Next PC > * ++ SP 
Trap vector N > PC 

0 > ST (GIE) 

Else, continue 

Dreg AND src 

srci AND src2 

Dreg XOR src > Dreg 
src1 XOR src2 — Dreg 


RE repeat interrupt register 

RM repeat mode bit 

Rn register address (R7—RO) 

RS repeat start register 

SP stack pointer 

Sreg register address (any register) 
ST status register 

src general addressing modes 


srci 3-operand addressing modes 
src2 3-operand addressing modes 
TOS top of stack 
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13.3 Parallel Instruction Set Summary 


Table 13-9 lists the ’C3x instruction set in alphabetical order. Each table 
entry shows the instruction mnemonic, description, and operation. Refer to 
Section 13.1 for a functional listing of the instructions and individual instruc- 
tion descriptions. 


Table 13-9. Parallel Instruction Set Summary 


(a) Parallel arithmetic with store instructions 


Mnemonic Description Operation 
ABSF Absolute value of a floating point |src2| > dst 
| STF || src3 > dst2 
ABSI Absolute value of an integer |src2| — dst 
| STI || src3 > dst2 
ADDF3 Add floating-point value srci1 + src2 > dst 
| STF || src3 > dst2 
ADDI3 Add integer srci1 + src2 > dst 
| STI || src3 > dst2 
AND3 Bitwise-logical AND srci1 AND src2 —> dst 
| STI || src3 > dst2 
ASH3 Arithmetic shift If count 2 0: 
| STI (src2 << count) > dst 
|| src3 > dst2 
Else: 
(src2 >> |count|) > dst 
|| src3 > dst2 
FIX Convert floating-point value to integer Fix (src2) > dst 
| STI || src3 — dst2 
FLOAT Convert integer to floating-point value Float(src2) — dst 
|| STF || src3 > dst2 
Legend: count register addr (R7—RO) op3 register addr (RO or R1) 
dst register addr (R7—RO) op6 register addr (R2 or R3) 
dsi2 indirect addr (disp = 0, 1, IRO, IR1) src register addr (R7—RO) 
op1, op2, op4, and op5 src2 indirect addr (disp = 0, 1, IRO, IR1) 
Any two of these operands must be src3 register addr (R7—RO) 


specified using register addr; the remaining 
two must be specified using indirect. 
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Table 13-9. Parallel Instruction Set Summary (Continued) 


(a) Parallel arithmetic with store instructions (Continued) 


Mnemonic Description 

LDF Load floating-point value 
|| STF 

LDI Load integer 
|| STI 

LSH3 Logical shift 
|| STI 

MPYF3 Multiply floating-point value 
|| STF 

MPYI3 Multiply integer 
|| STI 

NEGF Negate floating-point value 
|| STF 

NEGI Negate integer 
|| STI 

NOT Complement 
|| STI 

OR3 Bitwise-logical OR 
|| STI 

STF Store floating-point value 
|| STF 

STI Store integer 
|| STI 
Legend: count register addr (R7—RO) 
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dst register addr (R7—RO) 

dsi2 indirect addr (disp = 0, 1, IRO, IR1) 

opt, op2, op4, and op5 
Any two of these operands must be 
specified using register addr; the remaining 
two must be specified using indirect. 


op3 
op6 
srci 
src2 
src3 


Operation 
src2 — dst 
|| src3 > dsi2 
src2 — dst 
|| src3 > dsi2 
If count 2 0: 
src2 << count > dst 
|| src3 > dsi2 
Else: 
src2 >> |count| > dst 
src3 — dsi2 
srci x src2 — dst 
src3 — dsi2 
srci x src2 — dst 
src3 — dsi2 
0 - src2 — ast 
src3 — dsi2 
0 - src2 > ast 
src3 — dsi2 
src — dstt 
src3 — dsi2 
src1 OR src2 > dst 
src3 — dsi2 
srci > dst 
src3 — dsi2 


src1 —> dst 


src3 — dsi2 


register addr (RO or R1) 

register addr (R2 or R3) 

register addr (R7—RO) 

indirect addr (disp = 0, 1, IRO, IR1) 
register addr (R7—RO) 


Parallel Instruction Set Summary mnemonic 


Table 13-9. Parallel Instruction Set Summary (Continued) 


(a) Parallel arithmetic with store instructions (Continued) 


Mnemonic Description Operation 
SUBF3 Subtract floating-point value src1 — src2 > dst 
| STF || src3 > dst2 
SUBI3 Subtract integer src1 — src2 > dst 
| STI || src3 > dst2 
XOR3 Bitwise-exclusive OR src1 XOR src2 —> dst 
| STI || src3 > dst2 


(b) Parallel load instructions 


Mnemonic Description Operation 
LDF Load floating-point value src2 — dst 
|| LDF || src4 — dsi2 
LDI Load integer src2 — dst 
|| LDI || src4 — dsi2 


(c) Parallel multiply and add/subtract instructions 


Mnemonic Description Operation 
MPYF3 Multiply and add floating-point value op1 x op2 > op3 
| ADDF3 || op4 + op5 — op6 
MPYF3 Multiply and subtract floating-point value op1 x op2 > op3 
| SUBF3 || op4 — op5 > op6 
MPYI3 Multiply and add integer op1 x op2 > op3 
| ADDI38 || op4 + op5 > op6 
MPYI3 Multiply and subtract integer op1 x op2 > op3 
| SUBI3 || op4 — op5 > op6 
Legend: count register addr (R7—RO) ops register addr (RO or R1) 
dst register addr (R7—RO) op6 register addr (R2 or R3) 
dsi2 indirect addr (disp = 0, 1, IRO, IR1) src register addr (R7—RO) 
op1, op2, op4, and op5 src2 indirect addr (disp = 0, 1, IRO, IR1) 
Any two of these operands must be src3 register addr (R7—RO) 


specified using register addr; the remaining 
two must be specified using indirect. 
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13.4 Group Addressing Mode Instruction Encoding 


The six addressing types (covered in Section 6.1, Addressing Types, on 
page 6-2) form these four groups of addressing modes: 


1 General addressing modes (G) 

_j] 3-operand addressing modes (T) 

Lj] Parallel addressing modes (P) 

_j Conditional-branch addressing modes (B) 


13.4.1 General Addressing Modes 
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Instructions that use the general addressing modes are general-purpose 
instructions, such as ADDI, MPYF, and LSH. Such instructions usually have 
this form: 


dst operation src — dst 


In the syntax, the destination operand is signified by dst and the source operand 
by src; operation defines an operation to be performed on the operands using the 
general addressing modes. Bits 31-29 are 0, indicating general addressing 
mode instructions. Bits 22 and 21 specify the general addressing mode (G) field, 
which defines how bits 15-0 are to be interpreted for addressing the src operand. 


Options for bits 22 and 21 (G field) are as follows: 


G Mode 

00 Register (all CPU registers unless specified otherwise) 
01 Direct 

10 Indirect 

11 Immediate 


If the src and dst fields contain register specifications, the value in these 
fields contains the CPU register addresses as defined by Table 13-10. For 
the general addressing modes, the following values of ARn are valid: 


ARn,0 <n<7 
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Figure 13-1 shows the encoding for the general addressing modes. The notation 
modn indicates the modification field that goes with the ARn field. Refer to 
Table 13-10 on page 13-22 for further information. 


Figure 13-1. Encoding for General Addressing Modes 


Destination Source Operands 


nyo 8]7_5]4 | 
operation 000000000 0 
0 


operation direct 


operation ARn 


operation immediate 
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Table 13-10. Indirect Addressing 


(a) Indirect addressing with displacement 


Mod Field 
00000 
00001 
00010 


00011 


00100 


00101 


00110 


001114 


Syntax 
*+ARn(disp) 
*_ARn(disp) 
*++ARn(disp) 


*--ARn(disp) 


*ARn++(disp) 


*ARn--— (disp) 


*ARn++(disp)% 


*ARn—-—(disp)% 


Operation 
addr = ARn + disp 
addr = ARn - disp 


addr = ARn + disp 
ARn = ARn + disp 


addr = ARn - disp 
ARn = ARn - disp 


addr = ARn 
ARn = ARn + disp 


addr = ARn 
ARn = ARn - disp 


addr = ARn 


ARn = circ(ARn + disp) 


addr = ARn 


ARn = circ(ARn — disp) 


(b) Indirect addressing with index register [RO 


Mod Field 
01000 
01001 
01010 


01011 


01100 


01101 


01110 


01114 


Legend: 
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addr 
ARn 


circ( 
disp 


Syntax 
*+ARn(IRO) 
*_ARn(IRO) 
*++ARn(IRO) 


*__ ARn(IRO) 


*ARn++(IRO) 


*ARn--—(IRO) 


*ARn++(IRO)% 


*ARn-—(IRO)% 


Operation 
addr = ARn + IRO 
addr = ARn — IRO 


addr = ARn + IRO 
ARn = ARn + IRO 


addr = ARn - IRO 
ARn = ARn - IRO 


addr = ARn 
ARn = ARn + |IRO 


addr= ARn 
ARn = ARn - IRO 


addr = ARn 


ARn = circ(ARn + IRO) 


addr = ARn 


ARn = circ(ARn- IRO) 


memory address 


auxiliary registers ARO-AR7 
) address in circular addressing 


displacement 


Description 
With predisplacement add 
With predisplacement subtract 


With predisplacement add and modify 


With predisplacement subtract and modify 


With postdisplacement add and modify 


With postdisplacement subtract and modify 


With postdisplacement add and circular modify 


With postdisplacement subtract and circular modify 


Description 

With preindex (IRO) add 

With preindex (IRO) subtract 

With preindex (IRO) add and modify 


With preindex (IRO) subtract and modify 


With postindex (IRO) add and modify 


With postindex (IRO) subtract and modify 


With postindex (IRO) add and circular modify 


With postindex (IRO) subtract and circular modify 


++ add and modify 

-- subtract and modify 

% where circular addressing is performed 
IRn index register IRO or IR1 
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Table 13-10. Indirect Addressing (Continued) 


(c) Indirect addressing with index register IR1 


Mod Field Syntax Operation Description 

10000 *+ARn(IR1) addr = ARn + IR1 With preindex (IR1) add 

10001 *-ARn(IR1) addr = ARn - IR1 With preindex (IR1) subtract 

10010 *4+4+ ARn(IR1) addr = ARn + IR1 With preindex (IR1) add and modify 


ARn= ARn + IR1 


100114 *—--ARn(IR1) addr = ARn - IR1 With preindex (IR1) subtract and modify 
ARn= ARn -IR1 


10100 *ARn++(IR1) addr = ARn With postindex (IR1) add and modify 
ARn= ARn + IR1 


10101 *ARn--(IR1) addr = ARn With postindex (IR1) subtract and modify 
ARn=ARn-IR1 


10110 *ARn++(IR1)% addr =ARn With postindex (IR1) add and circular modify 
ARn = circ(ARn + IR1) 


10111 *“ARn--(IR1)%  addr=ARn With postindex (IR1) subtract and circular modify 
ARn = circ(ARn - IR1) 


(d) Indirect addressing (special cases) 


Mod Field Syntax Operation Description 

11000 *ARn addr = ARn Indirect 

11001 *ARn++(IRO)B addr = ARn With postindex (IRO) add 

ARn = B(ARn + IRO) and bit-reversed modify 
Legend: addr memory address circ() address in circular addressing 

ARn auxiliary registers ARO-AR7 ++ add and modify by one 
B where bit-reversed addressing is performed % where circular addressing is performed 
B() bit-reversed address IRn index register IRO or IR1 


-- subtract and modify by one 
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13.4.2 3-Operand Addressing Modes 
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Instructions that use the 3-operand addressing modes, such as ADDI3, LSH3, 
CMPF3, or XOR8, usually have this form: 


srci operation src2 — dst 


where the destination operand is signified by dst and the source operands by 
srci and src2; operation defines an operation to be performed. 


SS  ——  ——————_——————ll 


Note: 


The 3can be omitted from a 3-operand instruction mnemonic. 
a | 


Bits 31-29 are set to the value of 001, indicating 3-operand addressing 
mode instructions. Bits 22 and 21 specify the 3-operand addressing mode 
(T) field, which defines how bits 15-0 are to be interpreted for addressing 
the SRC operands. Bits 15-8 define the SRC1 address; bits 7-0 define the 
SRC2 address. Options for bits 22 and 21 (T) are as follows: 


T src1 addressing modes src2 addressing modes 
00 Register mode Register mode 
(any CPU register) (any CPU register) 
01 Indirect mode Register mode 
(disp = 0, 1, IRO, IR1) (any CPU register) 
10 Register mode Indirect mode 
(any CPU register) (disp = 0, 1, IRO, IR1) 
11 Indirect mode Indirect mode 
(disp = 0, 1, IRO, IR1) (disp = 0, 1, IRO, IR1) 


Figure 13-2 shows the encoding for 3-operand addressing. If the src1 and 
src2 fields both modify the same auxiliary register, both addresses are correctly 
generated. However, only the value created by the srci field is saved in the 
auxiliary register specified. The assembler issues a warning if you specify this 
condition. 
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The following values of ARn and ARm are valid: 


ARn,O<n<7 
ARm,0<m<7 


The notation modm or modn indicates the modification field that goes with the 
ARmor ARn field, respectively. Refer to Table 13-10 on page 13-22 for further 
information. 


In indirect addressing of the 3-operand addressing mode, displacements (if used) 
are allowed to be 0 or 1, and the index registers (IRO and IR1) can be used. The 
displacement of 1 is implied and is not explicitly coded in the instruction word. 


Figure 13-2. Encoding for 3-Operand Addressing Modes 


operation 


Destination 


operation 


operation 


operation 


13.4.3 Parallel Addressing Modes 


Instructions that use parallel addressing, indicated by || (two vertical bars), allow 
the most parallelism possible. The destination operands are indicated as d1 
and d2, signifying dst1 and dsi2, respectively (see Figure 13-3). The source 
operands, signified by src1 and src2, use the extended-precision registers. 
Operation refers to the parallel operation to be performed. 


Figure 13-3. Encoding for Parallel Addressing Modes 


[3130 [20 26 |25 24 [25 [22 |a1_ ta jie [15st Jao 7] 2 


|i 0 operation] PF fat a2) srt | sca | modn | ARM | modm | ARM 


The parallel addressing mode (P) field specifies how the operands are to be 
used, that is, whether they are source or destination. The specific relation- 
ship between the P field and the operands is detailed in the description of the 
individual parallel instructions. However, the operands are always encoded 
in the same way. Bits 31 and 30 are set to the value of 10, indicating parallel 
addressing mode instructions. Bits 25 and 24 specify the parallel addressing 
mode (P) field, which defines how to interpret bits 21-0 for addressing the src 
operands. Bits 21-19 define the srci address, bits 18-16 define the src2 
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address, bits 15-8 the src3 address, and bits 7-0 the src 4 address. The 
notations modn and mod” indicate which modification field goes with which 
ARnor ARm (auxiliary register) field, respectively. The following list describes 
the parallel addressing operands: 


srci =Rn 0<ns<7 (extended-precision registers RO-—R7) 
src2 = Rn 0 <n<7 (extended-precision registers RO-R7) 


d1 If 0, dst1 is RO. If 1, dsf1 is R1. 
d2 If 0, dsi2 is R2. If 1, dsi2 is R3. 
P 0<P<3 

src3 indirect (disp = 0, 1, IRO, IR1) 
src4 indirect (disp = 0, 1, IRO, IR1) 


As in the 3-operand addressing mode, indirect addressing in the parallel 
addressing mode allows for displacements of 0 or 1 and the use of the index 
registers (IRO and IR1). The displacement of 1 is implied and is not explicitly 
coded in the instruction word. 


In the encoding shown for this mode in Figure 13-3, if the src3 and srcé4 fields 
use the same auxiliary register, both addresses are correctly generated, but 
only the value created by the src3 field is saved in the auxiliary register specified. 
The assembler issues a warning if you specify this condition. 


The encoding of these parallel addressing modes has been extended in the 
following devices: 


_j ’C31 silicon revision 6.0 or greater 
J ’C2 silicon revision 2.0 or greater 


These addressing mode extensions also allow the use of any CPU register when- 
ever an indirect operand is required in src3 and/or src4 operand. Figure 13-4 
shows the encoding for extended parallel addressing instructions. 


Figure 13-4. Encoding for Extended Parallel Addressing Instructions 


[3130 20 [28 26 [2s 24 [23 | 22 [av ta lie te jis 13 |12_ ales [4 


[tO operation] Pi dt [d2 [srt | seca |i ot 1 | srs [1 1 1] sro 
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13.4.4 Conditional-Branch Addressing Modes 


Instructions using the conditional-branch addressing modes (Bcond, BcondD, 
CALLcond, DBcond, and DBcondD) can perform a variety of conditional 
operations. Bits 31-27 are set to the value of 01101, indicating conditional-branch 
addressing mode instructions. Bit 26 is set to 0 or 1; 0 selects DBcona, 1 selects 
Bcond. Selection of bit 25 determines the conditional-branch addressing mode 
(B). If B = 0, register addressing is used; if B = 1, PC-relative addressing is used. 
Bit 21 sets the type of branch: D = 0 for a standard branch or D = 1 for a delayed 
branch. The condition field (cond) specifies the condition checked to determine 
what action to take, that is, whether to branch (see Table 13-12 on page 13-30 
for a list of condition codes). Figure 13-5 shows the encoding for conditional- 
branch addressing. 


Figure 13-5. Encoding for Conditional-Branch Addressing Modes 


(a) DBcond (D) 


cond 00000000 000 src reg 
D cond immediate (PC relative) 


cond 000000 00000 src reg 


immediate (PC relative) 


cond 000000000 00 src reg 


immediate (PC relative) 
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13.5 Condition Codes and Flags 


The ’C3x provides 20 condition codes (00000-10100, excluding 01011) that 
you can place in the cond field of any of the conditional instructions, such as 
RETScond or LDFcond. The conditions include signed and unsigned compari- 
sons, comparisons to 0, and comparisons based on the status of individual 
condition flags. All conditional instructions can accept the suffix U to indicate 
unconditional operation. 


Seven condition flags provide information about properties of the result of 
arithmetic and logical instructions. The condition flags are stored in the status 
register (ST) and are affected by an instruction only when either of the following 
two cases occurs: 


(J The destination register is one of the extended-precision registers 
(R7—RO). (This allows for modification of the registers used for addressing 
but does not affect the condition flags during computation.) 


(1 The instruction is one of the compare instructions (CMPF, CMPF3, CMPI, 
CMPI3, TSTB, or TSTB3). (This makes it possible to set the condition flags 
according to the contents of any of the CPU registers.) 


The condition flags are modified by most instructions when either of the pre- 
ceding conditions is established and either of the following two cases occurs: 


(1 A result is generated when the specified operation is performed to infinite 
precision. This is appropriate for compare and test instructions that do not 
store results in a register. It is also appropriate for arithmetic instructions 
that produce underflow or overflow. 


(1 The output is written to the destination register, as shown in Table 13-11. 
This is appropriate for other instructions that modify the condition flags. 


Table 13-11. Output Value Formats 
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Type of Operation Output Format 

Floating point 8-bit exponent, one sign bit, 31-bit fraction 
Integer 32-bit integer 

Logical 32-bit unsigned integer 


Figure 13-6 shows the condition flags in the low-order bits of the status register. 
Following the figure is a list of status register condition flags with a description 
of how the flags are set by most instructions. For specific details of the effect of 
a particular instruction on the condition flags, see the description of that instruc- 
tion in Section 13.6, /ndividual Instruction Descriptions, on page 13-32. 
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Figure 13-6. Status Register 


13. 16, 15 14 13. 12,11 10 9 817 6 5 4,3 2 1 0 

ate a sate |oc ce |r| aciows| tur w]urin | z]v || 

xx + status config | GIE |CC.CE| CF] xx |RM:OVM|LUF/LV]UF.1N | Z| V|C 
'(’C32 only) | (C32 onl ' 


' R R/W R/W R/WiR/W R/W R/W) R/W R/W sR/W R/WIRIW RW R/W R/W 


Note: xx =reserved bit, read as 0 
R = read, W = write 


LUF Latched floating-point underflow condition flag. LUF is set whenever UF 
(floating-point underflow flag) is set. LUF can be cleared only by a processor 
reset or by modifying it in the status register (ST). 


LV Latched overflow conditionfFlag. LV is set whenever V (overflow condition 
flag) is set. Otherwise, it is unchanged. LV can be cleared only by a processor 
reset or by modifying it in the status register (ST). 


UF Floating-point underflow conditionflag. A floating-point underflow occurs 
whenever the exponent of the result is less than or equal to —128. If a floating- 
point underflow occurs, UF is set, and the output value is set to 0. UF is 
cleared if a floating-point underflow does not occur. 


N Negative condition flag. Logical operations assign N the state of the MSB 
of the output value. For logical operations, V is set to the state of the MSB. 
For integer and floating-point operations, N is set if the result is negative and 
cleared otherwise. A 0 is positive. 


Z Zero condition flag. For logical, integer, and floating-point operations, Z is 
set if the output is O and cleared otherwise. 


V Overflow condition flag. For integer operations, V is set if the result does 
not fit into the format specified for the destination (that is, -2 32 < result < 2 
32 _ 1). Otherwise, V is cleared. For floating-point operations, V is set if the 
exponent of the result is greater than 127; otherwise,V is cleared. Logical 
operations always clear V. 


Cc Carry flag. When an integer addition is performed, C is set if a carry occurs 
out of the bit corresponding to the MSB of the output. When an integer 
subtraction is performed, C is set if a borrow occurs into the bit corresponding 
to the MSB of the output. Otherwise, for integer operations, C is cleared. The 
carry flag is unaffected by floating-point and logical operations. For shift 
instructions, this flag is set to the last bit shifted out; for a 0 shift count, this 
is set to 0. 
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Table 13-12 lists the condition mnemonic, code, description, and flag for each 
of the 20 condition codes. 


Table 13-12. Condition Codes and Flags 
(a) Unconditional compares 
Condition Code Description Flagt 
U 00000 Unconditional lrrevelant 


(b) Unsigned compares 


Condition Code Description Flagt 

LO 00001 Lower than C 

LS 00010 Lower than or same as CORZ 

HI 00011 Higher than ~C AND ~Z 
HS 00100 Higher than or same as ~C 

EQ 00101 Equal to Z 

NE 00110 Not equal to ~Z 


(c) Signed compares 


Condition Code Description Flagt 

LT 00111 Less than N 

LE 01000 Less than or equal to NORZ 

GT 01001 Greater than ~N AND ~Z 
GE 01010 Greater than or equal to ~N 

EQ 00101 Equal to Z 

NE 00110 Not equal to ~Z 


t~= logical complement (not true condition) 
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Table 13-12. Condition Codes and Flags (Continued) 


(d) Compare to zero 


Condition Code 
Z. 00101 
NZ 00110 
P 01001 
N 00111 
NN 01010 


(e) Compare to condition flags 


Condition Code 


NN 01010 
N 001114 
NZ 00110 
Z 00101 
NV 01100 
V 01101 
NUF 01110 
UF 01114 
NC 00100 
Cc 00001 
NLV 10000 
LV 10001 
NLUF 10010 
LUF 10011 
ZUF 10100 


Description 


Zero 

Not zero 
Positive 
Negative 
Non-negative 


Description 


Non-negative 
Negative 
Nonzero 
Zero 

No overflow 
Overflow 

No underflow 
Underflow 
No carry 
Carry 


No latched overflow 

Latched overflow 

No latched floating-point underflow 
Latched floating-point underflow 
Zero or floating-point underflow 


t ~ = logical complement (not true condition) 


~UF 

UF 

~C 

Cc 

~LV 

LV 

~LUF 
LUF 

Z OR UF 
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13.6 Individual Instructions 


This section contains the individual assembly language instructions for the ’C3x. 
The instructions are listed in alphabetical order. Information for each instruction 
includes assembler syntax, operation, operands, encoding, description, cycles, 
status bits, mode bit, and examples. 


Definitions of the symbols and abbreviations, as well as optional syntax forms 
allowed by the assembler, precede the individual instruction description section. 
Also, an example instruction shows the special format used and explains its 
content. 


A functional grouping of the instructions, as well as a complete instruction set 
summary, can be found in Section 13.1 on page 13-2. Appendix A lists the 
opcodes for all of the instructions. Refer to Chapter 6 for information on 
memory addressing. Code examples using many of the instructions are 
provided in Chapter 1, Software Applications, of the TMS320C3x General- 
Purpose Applications User’s Guide. 


13.6.1 Symbols and Abbreviations 
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Table 13-13 lists the symbols and abbreviations used in the individual instruc- 
tion descriptions. 


Table 13-13. Instruction Symbols 


Symbol 


src 
srci 
src2 
src3 
src4 


dst 
dst 
dsi2 
disp 
cond 
count 
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Meaning 


Source operand 

Source operand 1 
Source operand 2 
Source operand 3 
Source operand 4 


Destination operand 
Destination operand 1 
Destination operand 2 
Displacement 
Condition 

Shift count 


General addressing modes 
3-operand addressing modes 

Parallel addressing modes 
Conditional-branch addressing modes 


Absolute value of x 

Assign the value of x to destination y 
Mantissa field (sign + fraction) of x 
Exponent field of x 


mnemonic 


Operation 1 performed in parallel with operation 2 


Bitwise-logical AND of x and y 
Bitwise-logical OR of x and y 
Bitwise-logical XOR of x and y 
Bitwise-logical complement of x 


Shift x to the left y bits 
Shift x to the right y bits 


Increment SP and use incremented SP as address 


Use SP as address and decrement SP 


Auxiliary register n 

Index register n 

Register address n 

Repeat count register 
Repeat end address register 
Repeat start address register 
Status register 
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C Carry bit 

GIE Global interrupt enable bit 
N Trap vector 

PC Program counter 

RM Repeat mode flag 

SP System stack pointer 


13.6.2 Optional Assembler Syntax 


The assembler allows a relaxed syntax form for some instructions. These 
optional forms simplify the assembly language so that special-case syntax 
can be ignored. A list of the optional syntax forms follows. 


Lj) You can omit the destination register on unary arithmetic and logical oper- 
ations when the same register is used as a source. For example, 


ABSI RO,RO can be written as ABSI RO 


Instructions affected: ABSI, ABSF, FIX, FLOAT, NEGB, NEGF, NEGI, 
NORM, NOT, RND 


(J You can write all 3-operand instructions without the 3. For example, 
ADDI3 RO,R1,R2 can be written as ADDI RO,R1,R2 


Instructions affected: ADDC3, ADDF3, ADDI3, AND3, ANDN3, ASHS, 
LSH3, MPYF8, MPYI3, OR3, SUBB3, SUBF3, SUBI3, XOR3 


This also applies to all of the pertinent parallel instructions. 


(J You can write all 3-operand comparison instructions without the 3. For 
example, 


CMPI3 RO, *ARO can be written as CMPI RO, *ARO 
Instructions affected: CMPI3, CMPF3, TSTB3 


(1 Indirect operands with an explicit 0 displacement are allowed. In 3-operand 
or parallel instructions, operands with 0 displacement are automatically 
converted to no-displacement mode. For example: 


LDI *+AR0(0),R1_ is legal. 
Also 


ADDI3 *+ARO(0),R1,R2 IS equivalent to ADDI3 *ARO,R1,R2 


(1 You can write indirect operands with no displacement, in which case a 
displacement of 1 is assumed. For example, 


LDI *ARO++(1),RO canbe written as LDI *ARO++,RO 
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All conditional instructions accept the suffix U to indicate unconditional 
operation. Also, you can omit the U from unconditional short branch 
instructions. For example: 


BU label can be written as B label 


You can write labels with or without a trailing colon. For example: 


labelo: NOP 
labell NOP 
label2: label assembles to next source line 


Empty expressions are not allowed for the displacement in indirect mode: 


LDI *+AR0() ,RO is not legal. 


You can precede long immediate mode operands (destination of BR and 
CALL) with an @ sign: 


BR label can be written as BR @label 


You can use the LDP pseudo-op to load a register (usually DP) with the 
eight most significant bits (MSBs) of a relocatable address: 


LDP addr,REG or LDP @addr,REG 
The @ sign is optional. 


If the destination register is the DP, you can omit the DP in the operand. LDP 
generates an LDI instruction with an immediate operand and a special relo- 
cation type. 


You can write parallel instructions in either order. For example: 


ADDI can be written as STI 
|| STI || ADDI 


You can write the parallel bars indicating part 2 of a parallel instruction 
anywhere on the line from column 0 to the mnemonic. For example: 


ADDI can be written as ADDI 
|| STI || STI 


If the second operand of a parallel instruction is the same as the third (desti- 
nation register) operand, you can omit the third operand. This allows you to 
write 3-operand parallel instructions that look like normal 2-operand instruc- 
tions. For example, 


ADDI *ARO,R2,R2 can be written as ADD *ARO,R2 
|| MPYI *AR1,R0,RO || MPYI *AR1,RO 


Instructions affected (applies to all parallel instructions that have a register 
second operand): ADDI, ADDF, AND, MPYI, MPYF, OR, SUBI, SUBF, 
XOR. 
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Lj) You can write all commutative operations in parallel instructions in either 
order. For example, you can write the ADDI part of a parallel instruction in 
either of two ways: 


ADDI *ARO,R1,R2 or ADDI R1,*ARO,R2 


Instructions affected include: parallel instructions containing any of the 
following: ADDI, ADDF, MPYI, MPYF, AND, OR, XOR. 


(1 Use the syntax in Table 13-14 to designate CPU registers in operands. 
Note the alternate notation Rn, 0 < n < 27, which is used to designate 
any CPU register. 
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Table 13-14. CPU Register Syntax 


Assemblers 
Syntax 


RO 
R1 
R2 
R3 
R4 
R5 
R6 
R7 


ARO 
AR1 
AR2 
AR3 
AR4 
ARS 
AR6 
AR7 


IRO 
IR1 


Running Title—Attribute Reference mnemonic 


Alternate 
Register Syntax 


RO 
Ri 
R2 
R3 
R4 
R5 
R6 
R7 


R8 
RQ 
R10 
Ri1 
R12 
R13 
R14 
R15 


R16 
R17 
R18 
R19 
R20 


R21 
R22 
R23 
R24 


R25 
R26 
R27 


Assigned Function 


Extended-precision register 
Extended-precision register 
Extended-precision register 
Extended-precision register 
Extended-precision register 
Extended-precision register 
Extended-precision register 


Extended-precision register 


Auxiliary register 
Auxiliary register 
Auxiliary register 
Auxiliary register 
Auxiliary register 
Auxiliary register 
Auxiliary register 


Auxiliary register 


Data-page pointer 
Index register 0 
Index register 1 
Block-size register 


Active stack pointer 


Status register 

CPU/DMA interrupt enable 
CPU interrupt flags 

/O flags 


Repeat start address 
Repeat end address 


Repeat counter 
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13.6.3 Individual Instruction Descriptions 


Each assembly language instruction for the ’C3x is described in this section 
in alphabetical order. The description includes the assembler syntax, operation, 
operands, encoding, description, cycles, status bits, mode bit, and examples. 
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Syntax 


Operation 


Operands 


Example Instruction EXAMPLE 


INST src, dst 
or 


INST1 src2, dst1 
|| INST2 src3, dst2 


Each instruction begins with an assembler syntax expression. You can place 
labels either before the command (instruction mnemonic) on the same line or 
on the preceding line in the first column. The optional comment field that con- 
cludes the syntax is not included in the syntax expression. Space(s) are 
required between each field (label, command, operand, and comment fields). 


The syntax examples illustrate the common one-line syntax and the two-line 
syntax used in parallel addressing. Note that the two vertical bars || that indicate 
a parallel addressing pair can be placed anywhere before the mnemonic on the 
second line. The first instruction in the pair can have a label, but the second 
instruction cannot have a label. 


|src | > dst 
or 


|src2 | > dst? 
|| src3 > dsit2 


The instruction operation sequence describes the processing that occurs when 
the instruction is executed. For parallel instructions, the operation sequence is 
performed in parallel. Conditional effects of status-register-specified modes are 
listed for such conditional instructions as Bcond. 


src general-addressing modes (G): 


00 register (Rn, O< n< 27) 
01 direct 

10 — indirect 

11 immediate 


dst register (Rn, O< n< 27) 
or 


src2__ indirect (disp = 0, 1, IRO, IR1) 
dst? register (Rn1,0<1n1 <7) 
src3_ register (Rn2,0 < n2 <7) 
dst2 indirect (disp = 0, 1, IRO, IR1) 


Operands are defined according to the addressing mode and/or the type 
of addressing used. Note that indirect addressing uses displacements and 
the index registers. See Chapter 6 for detailed information on addressing. 


Assembly Language Instructions 13-39 


EXAMPLE 


Opcode 


Description 


Cycles 


Status Bits 


Mode Bit 


13-40 


Example Instruction 


31 24 23 1615 8 7 0 


or 


3 24 23 1615 8 7 0 


‘ 
INST | INST2 ast. [0 0 0 src3 dst2 src2 


Encoding examples are shown using general addressing and parallel addressing. 
The instruction pair for the parallel addressing example consists of INST1 and 
INST2. 


Instruction execution and its effect on the rest of the processor or memory con- 
tents is described. Any constraints on the operands imposed by the processor 
or the assembler are discussed. The description parallels and supplements 
the information given by the operation block. 


1 

The digit specifies the number of cycles required to execute the instruction. 

LUF _Latched floating-point underflow condition flag. 1 if a 
floating-point underflow occurs; unchanged otherwise. 


LV Latched overflow condition flag. 1 if an integer or floating-point 
overflow occurs; unchanged otherwise. 


UF Floating-point underflow condition flag. 1 if a floating-point underflow 
occurs; 0 otherwise. 


N Negative condition flag. 1 if a negative result is generated; 0 other- 
wise. In some instructions, this flag is the MSB of the output. 


Z Zero condition flag. 1 if a 0 result is generated; 0 otherwise. For logical 
and shift instructions, 1 if a 0 output is generated; 0 otherwise. 

Vv Overflow condition flag. 1 if an integer or floating-point overflow 
occurs; 0 otherwise. 

Cc Carry flag. 1 if a carry or borrow occurs; 0 otherwise. For shift instruc- 


tions, this flag is set to the value of the last bit shifted out; 0 for a shift 
count of 0. 


The seven condition flags stored in the status register (ST) are modified by the 
majority of instructions only if the destination register is R7—RO. The flags pro- 
vide information about the properties of the result or the output of arithmetic 
or logical operations. 


OVM Overflow mode flag. In general, integer operations are affected by the 
OVM bit value (described in Table 3-2 on page 3-6). 


Example 


Example Instruction EXAMPLE 


INST @98AEh,R5 


R5 


R5 decimal 


Data memory 
8098AEh 
0200h 


Before Instruction After Instruction 
R5 
RS decimal 

080 DP 080 
[ WWF [| 
[ A | 
[fe = = uw [Lo] 
——we 1 NC] 
[LO] z[_ 
———er vE J 
c= = sai) [= =a] 
B098AEN 
0200h 


The sample code presented in the above format shows the effect of the code on 
system pointers (for example, DP or SP), registers (for example, R1 or R85), 
memory at specific locations, and the seven status bits. The values given for the 
registers include the leading Os to show the exponent in floating-point operations. 
Decimal conversions are provided for all register and memory locations. The 
seven status bits are listed in the order in which they appear in the assembler and 
simulator (see Section 13.5 on page 13-28 and Table 13-12 on page 13-30 for 
further information on these seven status bits). 
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13-41 


ABSF Absolute Value of Floating Point 


Syntax 
Operation 


Operands 


Opcode 


Description 


Cycles 
Status Bits 


Mode Bit 
Example 
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ABSF src, dst 
|src| > dst 
src general addressing modes (G): 


00 register (Rn,O<n<7) 

01 direct 

10 _ indirect (disp = 0-255, IRO, IR1) 
11 immediate 


dst register (Rn,O< n<7) 


31 15 


24.23 16 87 0 


The absolute value of the src operand is loaded into the dst register. The src 
and dst operands are assumed to be floating-point numbers. 


An overflow occurs if src (man) = 80000000h and src (exp) = 7Fh. The result 
is dst (man) = 7FFFFFFFh and dst (exp) = 7Fh. 


{ 
These condition flags are modified only if the destination register is R7—RO. 
LUF Unaffected 


LV 1 if a floating-point overflow occurs; unchanged otherwise 
UF 0 
N 0 
Z 1 if a O result is generated; 0 otherwise 
Vv 1 if a floating-point overflow occurs; 0 otherwise 
Cc Unaffected 
OVM = Operation is not affected by OVM bit value. 
ABSF R4,R7 
Before Instruction After Instruction 


R4 5C 8000 F971|-9.90337307e+27 R4 5C 8000 F971] -9.90337307e+27 
R7 7D 2511 OOAE| 5.48527255e+37 R7 5C _7FFF 068F}] 9.90337307e+27 


Lue Lo LurL_ 
wl wll 
a UF[ 
NL] NE 
2 ee Z| 
vo v[~ 
es | eT) 


Syntax 


Operation 


Operands 


Opcode 


Description 


Cycles 
Status Bits 


Parallel ABSF and STF_ ABSF||STF 


ABSF src2, dst1 
|| STF — src3, dst2 


|src2 | > dst? 
| src3— dst2 


src2__ indirect (disp = 0, 1, IRO, IR1) 
dst? register (Rn1,0< ni <7) 
src3_ register (Rn2, 0 < n2<7) 
dst2 indirect (disp = 0, 1, IRO, IR1) 


This instruction’s operands have been augmented in the following devices: 


_j ’C31 silicon revision 6.0 or greater 
_j ’C32 silicon revision 2.0 or greater 
LV ’VC33 


src2__ indirect (disp = 0, 1, IRO, IR1) or any CPU register 
dst? register (Rni,0<n1 <7) 
src3_ register (Rn2, 0 < n2 <7) 

indi isp = 0, t+ tROART 


3 24 23 1615 8 7 0 


1 
+ 1fo 010 0} ast Jo 0] src3 dsi2 src2 


A floating-point absolute value and a floating-point store are performed in paral- 
lel. All registers are read at the beginning and loaded at the end of the execute 
cycle. If one of the parallel operations (STF) reads from a register and the opera- 
tion being performed in parallel (ABSF) writes to the same register, STF accepts 
the contents of the register as input before it is modified by the ABSF. 


If src2 and dsit2 point to the same location, src2 is read before the write to dst2. 
If src3 and dst7 point to the same register, src3 is read before the write to dst7. 


An overflow occurs if src (man) = 80000000h and src (exp) = 7Fh. The result 
is dst (man) = 7FFFFFFFh and dst (exp) = 7Fh. 


1 


LUF Unaffected 
LV 1 if a floating-point overflow occurs; unchanged otherwise 


1 if a floating-point overflow occurs; 0 otherwise 


U 
N 
Z 1 if a O result is generated; 0 otherwise 
Vv 
Cc Unaffected 
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ABSF||STF Parallel ABSF and STF 


Mode Bit OVM = Operation is not affected by OVM bit value. 
Example ABSF *44—R3(IR1) ,R4 
|| STF R4, *-AR7 (1) 
Before Instruction After Instruction 
R4 1.79750e+02 R4 6.118750e+01 
AR3 ARS 
AR7 AR7 
IR1 IR1 OAF 
wuF[ uF [ol 
vt wf 
UF[ uF [oo] 
N[C N Lo] 
zt z Lo 
‘C4 _o_ 3 
7) c [To] 
Data Memory 
8098AF [_——5884000]-6.118750e+01 8098AF [ _58B4000]-6.118750e+01 
sossca [—~—~—=izd 8098C4 1.79750e+02 


Note: Cycle Count 


See Section 8.5.2, Data Loads and Stores, on page 8-24 for the effects of 
operand ordering on the cycle count. 
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Syntax 
Operation 


Operands 


Opcode 


Description 


Cycles 


Status Bits 


Mode Bit 


Absolute Value of Integer 


ABSI src, dst 


|src| — dst 


src general addressing modes (G): 


any CPU register 

direct 

indirect (disp = 0-255, IRO, IR1) 
immediate 


dst any CPU register 


31 


24 23 1615 8 7 


ABSI 


0 


The absolute value of the src operand is loaded into the dst register. The src 
and dst operands are assumed to be signed integers. 


An overflow occurs if src = 80000000h. If ST(OVM) = 1, the result is 


dst = 7FFFFFFFh. If ST(OVM) = 0, the result is dst = 80000000h. 


1 


These condition flags are modified only if the destination register is R7—RO. 


LUF 
LV 


Unaffected 

1 if an integer overflow occurs; unchanged otherwise 
0 

0 

1 if a O result is generated; 0 otherwise 

1 if an integer overflow occurs; 0 otherwise 
Unaffected 


Operation is affected by OVM bit value. 
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ABSI Absolute Value of Integer 


Example 1 ABSI RO,RO 
or 
ABST RO 
Before Instruction After Instruction 
RO -53 Ro 53 
Example 2 ABSI *AR1,R3 
Before Instruction After Instruction 
RS R3 53 
ARI ARI 
Data memory 
20 -53 20 -53 
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Syntax 


Operation 


Operands 


Opcode 


Description 


Cycles 


Parallel ABSI and ST! ABSI||STI 


ABSI _ src2, dst 
|| STI src3, dst2 


|src2 | > dst? 
| src3— dst2 


src2 indirect (disp = 0, 1, IRO, IR1) 
dst? register (Rn1,0<1<7) 

src3. register (Rn2,0<n2<7) 
dst2 indirect (disp =0, 1, IRO, IR1) 


This instruction’s operands have been augmented in the following devices: 


_j ’C31 silicon revision 6.0 or greater 
LJ ’C32 silicon revision 2.0 or greater 
LV ’VC33 


src2__ indirect (disp = 0, 1, IRO, IR1) or any CPU register 
dst? register (Rni, 0 <n1 <7) 

src3_ register (Rn2, 0 < n2 <7) 

dst2— indirect (disp = 0, t+ RO, tRt) 


31 24 23 1615 8 7 0 
BBE 010 1 ast [0 0 9) src3 dsi2 src2 


An integer absolute value and an integer store are performed in parallel. All 
registers are read at the beginning and loaded at the end of the execute cycle. 
If one of the parallel operations (STI) reads from a register and the operation 
being performed in parallel (ABSI) writes to the same register, STI accepts the 
contents of the register as input before it is modified by the ABSI. 


If src2 and dst2 point to the same location, src2 is read before the write to dst2. 


An overflow occurs if src = 80000000h. If ST(OVM) = 1, the result is dst = 
7FFFFFFFh. If ST(OVM) = 0, the result is dst = 80000000h. 


1 


Assembly Language Instructions 13-47 


ABSI||STI  Paralle! ABS! and STI 


Status Bits These condition flags are modified only if the destination register is R7—RO. 


LUF Unaffected 


LV 1 if an integer overflow occurs; unchanged otherwise 
UF 0 
N 0 
Z 1 if a O result is generated; 0 otherwise 
V 1 if an integer overflow occurs; 0 otherwise 
Cc Unaffected 
Mode Bit OVM Operation is affected by OVM bit value. 
Example ABSI *-AR5(1),R5 
| | STI R1,*AR2--(IR1) 
Before Instruction After Instruction 
Rt 66 Rt 66 
R5 RS 53 
AR2 AR2 
ARS ARS 
IR1 IR1 
wr [| uF [ol 
vl vw Co 
UF[__ uP [__ 
NL] N Ll 
2 2. a | 
2 . (2s 
cL ol c Lo 
Data memory 
8098FF 253 8098FF 53 
8099E1 2 8099E1 66 


Note: Cycle Count 


See Section 8.5.2, Data Loads and Stores, on page 8-24 for the effects of 
operand ordering on the cycle count. 
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Syntax 
Operation 


Operands 


Opcode 


Description 


Cycles 
Status Bits 


Mode Bit 


Example 


Add Integer With Carry ADDC 


ADDC src, dst 
dst + src+ C > dst 
src general addressing modes (G): 


00 any CPU register 

01 direct 

10 _ indirect (disp = 0-255, IRO, IR1) 
11 immediate 


dst any CPU register 


31 2423 1615 87 0 


The sum of the dst and src operands and the carry (C) flag is loaded into the 
dst register. The dst and src operands are assumed to be signed integers. 


1 


These condition flags are modified only if the destination register is R7—RO. 


LUF Unaffected 

LV 1 if an integer overflow occurs; unchanged otherwise 
UF 0 

N 1 if a negative result is generated; 0 otherwise 

Z 1 if a O result is generated; 0 otherwise 

V 1 if an integer overflow occurs; 0 otherwise 

Cc 1 if accarry occurs; 0 otherwise 


OVM Operation is affected by OVM bit value. 


ADDC R1,R5 

Before Instruction After Instruction 

R1 00 FFFF 5C25] -41,947 R1 00 FFFF 5C25| -41,947 

R5 00 FFFF 019E] -65,122 R5 00 FFFE 5DC4| -107,068 
tur [I uF Ld 
wool iv Lo 
UF [| UF [ 
NCL] N[ 
zL z{_ 
v[~ v~> 
cL] cL 
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ADDC3 Aad Integer With Carry, 3-Operand 


Syntax 
Operation 


Operands 


Opcode 


Description 


Cycles 


Status Bits 


Mode Bit 
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ADDC3 © src2, src1, dst 
src1 + src2+C — dst 
src? 3-operand addressing modes (T): 


00 any CPU register 
01 indirect (disp = 0, 1, IRO, IR1) 
10 any CPU register 
11 indirect (disp = 0, 1, IRO, IR1) 


src2 3-operand addressing modes (1): 


00 any CPU register 
01 any CPU register 
10 _ indirect (disp = 0, 1, IRO, IR1) 
11 indirect (disp = 0, 1, IRO, IR1) 


dst any CPU register 


31 2423 1615 8 7 0 


The sum of the src? and src2 operands and the carry (C) flag is loaded into 
the dst register. The src1, src2, and dst operands are assumed to be signed 
integers. 


{ 


These condition flags are modified only if the destination register is R7—RO. 


LUF Unaffected 

1 if an integer overflow occurs; unchanged otherwise 
0 

1 if a negative result is generated; 0 otherwise 

1 if a O result is generated; 0 otherwise 

1 if an integer overflow occurs; 0 otherwise 

1 if a carry occurs; 0 otherwise 


OVM Operation is affected by OVM bit value. 


N > 
Qo< =CeE 


Add Integer With Carry, 3-Operand ADDC3 


Example 1 ADDC3 *AR5++(IRO),R5,R2 
or 
ADDC3 R5, *AR5++(IRO) ,R2 
Before Instruction After Instruction 
R2 R2 50 
R5 102 R5 102 
ARS ARS 
IRO IRO 
FL uF Ld 
wil i a) 
U[ UP [ed 
NC NL 
z[_ zLd 
vE_ v [ 
c C 
Data memory 
809908 [____ OF FFFFFCB] -53 809908 “53 
Example 2 ADDC3 R2, R7, RO 
Before Instruction After Instruction 
RO RO 4671 
R2 700 R2 700 
R7 3970 R7 3970 
LFL wr [od 
2 wf 
UPL UF [___ 
NL n Ll 
zL_ z[L 
v~E v CL 
C cL 


Note: Cycle Count 


See Section 8.5.2, Data Loads and Stores, on page 8-24 for the effects of 
operand ordering on the cycle count. 
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ADDF Add Floating-Point Values 


Syntax 
Operation 


Operands 


Opcode 


Description 


Cycles 


Status Bits 


Mode Bit 


13-52 


ADDF ssrc, dst 
dst + src — dst 
src general addressing modes (G): 
00 register (Rn,O<n<7) 
01 direct 
10 indirect (disp = 0-255, IRO, IR1) 
11 immediate 


dst register (Rn,O0<n<7) 


31 2423 1615 8 7 0 


The sum of the dst and src operands is loaded into the dst register. The dst and 
src operands are assumed to be floating-point numbers. 


{ 


These condition flags are modified only if the destination register is R7—RO. 


LUF (1 ifa floating-point underflow occurs; unchanged otherwise 
LV 1 if a floating-point overflow occurs; unchanged otherwise 
UF 1 if a floating-point underflow occurs; 0 otherwise 

N 1 if a negative result is generated; 0 otherwise 

Z 1 if a O result is generated; 0 otherwise 

Vv 1 if a floating-point overflow occurs; 0 otherwise 


Cc Unaffected 
OVM = Operation is not affected by OVM bit value. 


Add Floating-Point Values ADDF 


Example ADDF *AR4++(IR1),R5 
Before Instruction After Instruction 
R5L_os zes0 000d ¢ 53759001 R5 5.3268750e+02 
aRL_____azooaod] AR4 
IRL taal IR1 
LucL__ Ltur[_. dj 
wld wold 
urL__ upL...._d 
nL NL 
zL_ zo 
vL_ dd vE dd 
cL cfd 


Data memory 


809800 86B2800)4.7031250e+02 809800 86B2800) 4.7013250e+02 
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ADDF3 = Aad Floating Point, 3-Operand 


Syntax 
Operation 


Operands 


Opcode 


Description 


Cycles 


Status Bits 


Mode Bit 
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ADDF3 src2, src1, dst 

src1 + src2 > dst 

src? 3-operand addressing modes (T): 
00 register (Rn1, 0<n1 <7) 
01 _ indirect (disp = 0, 1, IRO, IR1) 
10 register (Rn1,0<n1 <7) 
11. indirect (disp = 0, 1, IRO, IR1) 


src2 3-operand addressing modes (T): 


00 register (Rn2, 0 < n2<7) 
01 register (Rn2, 0 < n2 <7) 
10 _ indirect (disp = 0, 1, IRO, IR1) 
11 indirect (disp = 0, 1, IRO, IR1) 


dst register (Rn, 0 <n<7) 


31 2423 1615 8 7 0 
oo 1/0 00001 dst srci src2 


The sum of the src? and src2 operands is loaded into the dst register. The src7, 
src2, and dst operands are assumed to be floating-point numbers. 


{ 


These condition flags are modified only if the destination register is R7—RO. 


LUF (1 ifa floating-point underflow occurs; unchanged otherwise 
LV 1 if a floating-point overflow occurs; unchanged otherwise 
UF 1 if a floating-point underflow occurs; 0 otherwise 

N 1 if a negative result is generated; 0 otherwise 

Z 1 if a O result is generated; 0 otherwise 

Vv 1 if a floating-point overflow occurs; 0 otherwise 


Cc Unaffected 
OVM = Operation is not affected by OVM bit value. 


Add Floating Point, 3-Operand ADDF3 


Example 1 ADDF3 R6,R5,R1 
or 
ADDF3 R5,R6,R1 


Before Instruction After Instruction 


R1 00 0000 0000 R1 09 052C 0000]5.3268750e+02 
R5 05 7980 0000| 6.23750e+01 R5 | 05 7980 0000|6.23750e+01 
R6 08 6B28 0000|4.7031250e+02 R6 08 6B28 0000]4.7031250e+02 


| tur a) 
wl 9] wl 9] 
ur ur Ld 
C9] ‘Co 
za) zo 
vo vo 
cla) cf] 

Example 2 ADDF3 *+AR1 (1) ,*AR7++(IRO) ,R4 


Before Instruction 


After Instruction 


R4 R4 1.41695313e+02 
ARI ARI 
AR7 AR7 
IRO IRO 
LuF[_ wuFL__ 
wl wl 
UFL__ wl 
| NL 
zL_ Alc ___=i] 
v~E vL__ 
c[ cL 


Data memory 


g09s2th [______—700F000]1.28940e+02 809821h 1.28940e+02 
go99F0h [_34c2000)1.27590e+01 s099FOh [___34¢2000]1.27590e+01 


Note: Cycle Count 


See Section 8.5.2, Data Loads and Stores, on page 8-24 for the effects of 
operand ordering on the cycle count. 
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ADDF3]||STF Parallel ADDF3 and STF 


Syntax 


Operation 


Operands 


Opcode 


Description 


Cycles 
Status Bits 
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ADDFS3 ~ src2, src?, dst1 
|| STF src3, dst2 


src? + src2 — dst1 
|| src3 > dst2 


src? register (Rn1,0<n1 <7) 
src2__ indirect (disp = 0, 1, IRO, IR1) 
dst? register (Rn2, 0 < n2 <7) 
src3_ register (Rn8, 0 < n3 <7) 
dst2 indirect (disp = 0, 1, IRO, IR1) 


This instruction’s operands have been augmented in the following devices: 


(J ’C31 silicon revision 6.0 or greater 
_j ’C32 silicon revision 2.0 or greater 
Lj ’VC33 


src? register (Rn1,O0<n1 <7) 
src2 indirect (disp = 0, 1, IRO, IR1) or any CPU register 
dst? register (Rn2,0< n2<7) 
src3_ register (Rn38, 0 < n3 <7) 


dst2—indirect {disp —0, + 4ROARY 


31 24 23 1615 8 7 0 


1 1 fo 011 0) ast srt | src3 dsi2 src2 


A floating-point addition and a floating-point store are performed in parallel. All 
registers are read at the beginning and loaded at the end of the execute cycle. 
If one of the parallel operations (STF) reads from a register and the operation 
being performed in parallel (ADDF3) writes to the same register, STF accepts 
as input the contents of the register before it is modified by the ADDF3. 


If src2 and dst2 point to the same location, src2 is read before the write to dst2. 
1 
These condition flags are modified only if the destination register is R7—RO. 


LUF (1 ifa floating-point underflow occurs; unchanged otherwise 
1 if a floating-point overflow occurs; unchanged otherwise 
1 if a floating-point underflow occurs; 0 otherwise 

1 if a negative result is generated; 0 otherwise 

1 if a O result is generated; 0 otherwise 

1 if a floating-point overflow occurs; 0 otherwise 

U 


naffected 


QO<N2ZC 


Parallel ADDF3 and STF_ ADDF3\||STF 


Mode Bit OVM Operation is not affected by OVM bit value. 
Example ADDF3 *+AR3 (IR1) ,R2,R5 
| | STF R4,*AR2 
Before Instruction After Instruction 

R2 1.4050e+02 R2 1.4050e+02 
R4 6 .281250e+01 R4 6.281250e+01 
R5 RS 3.20250e+02 

AR2 AR2 

ARS ARS 

IR1 OAS IR1 

uF [ol lwFL__ 

fe | wl 

uF[ ul 
N[ NL] 
zL_ zi 
vL_ vL 
cL | eT) 

Data memory 
8098A5h 1.79750e+02 8098A5h 1.79750e+02 
goosFsh [si 8098F3h 6 .28125e+01 


Note: Cycle Count 


See Section 8.5.2, Data Loads and Stores, on page 8-24 for the effects of 
operand ordering on the cycle count. 
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ADDI Aad Integer 


Syntax 
Operation 


Operands 


Opcode 


Description 


Cycles 
Status Bits 


Mode Bit 


Example 
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ADDI src, dst 
dst + src > dst 
src general addressing modes (G): 


00 any CPU register 

01 direct 

10 _ indirect (disp = 0-255, IRO, IR1) 
11 immediate 


dst any CPU register 


31 2423 1615 8 7 0 


The sum of the dst and src operands is loaded into the the dst register. The 
dst and src operands are assumed to be signed integers. 


{ 


These condition flags are modified only if the destination register is R7—RO. 


LUF Unaffected 


LV 1 if an integer overflow occurs; unchanged otherwise 
UF 0 

N 1 if a negative result is generated; 0 otherwise 

Z 1 if a O result is generated; 0 otherwise 

Vv 1 if an integer overflow occurs; 0 otherwise 

Cc 1 if a carry occurs; 0 otherwise 


OVM Operation is affected by OVM bit value. 


ADDI R3,R7 

Before Instruction After Instruction 

RS [00 RFF FFCB] -53 R3 -53 
R7 53 R7 
LuF[_ LuF [__ 
ve sid iw fo 
uF[ uF [ol 
N[ NL 
zC 4 zd 
vL ol v Ll 
cL c[ 


Syntax 
Operation 


Operands 


Opcode 


Description 


Cycles 


Status Bits 


Mode Bit 


Add Integer, 3-Operand ADDI3 


ADDI3 <src2 >,<srcl >,<dst > 

sc? + src2 > dst 

src? 3-operand addressing modes (T): 
00 any CPU register 
01 _ indirect (disp = 0, 1, IRO, IR1) 
10 any CPU register 
11 indirect (disp = 0, 1, IRO, IR1) 


src2 3-operand addressing modes (T): 


00 any CPU register 

01 any CPU register 
indirect (disp = 0, 1, IRO, IR1) 
indirect (disp = 0, 1, IRO, IR1) 


dst any CPU register 


31 24.23 1615 87 0 


The sum of the src? and src2 operands is loaded into the dst register. The src7, 
src2, and dst operands are assumed to be signed integers. 


1 


These condition flags are modified only if the destination register is R7—RO. 


LUF Unaffected 


LV 1 if an integer overflow occurs; unchanged otherwise 
UF 0 

N 1 if a negative result is generated; 0 otherwise 

Z 1 if a O result is generated; 0 otherwise 

V 1 if an integer overflow occurs; 0 otherwise 

Cc 1 if a carry occurs; 0 otherwise 


OVM Operation is affected by OVM bit value. 
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ADDI3 = Add Integer, 3-Operand 


Example 1 ADDI3 R4,R7,R5 
Before Instruction After Instruction 
Ra 220 Ra 220 
R5 16 R5 [_00 0000 017¢] 380 
R7 160 R7 160 
LuF[ Lur [_ 
vl i 
UPL UF [df 
NL N [gd 
zL_ zL_ "dd 
VE YE al 
c Ld c Ld 
Example 2 ADDI3 *-AR3 (1) ,*AR6--(IRO) ,R2 
Before Instruction After Instruction 
R2 16 R2 [00 0000 6598 ]26,000 
AR3 AR3 
AR6 ARG 
IRO IRO 
wwFL_ LuF[__ 
vo 4 i |__ — a] 
uF wl 9] 
NL 9] NL] 
Z[-. _ _ 2122 —— — al 
iy Lal vL_ J 
cL ea] 
Data memory 
809801 11,000 809801 11,000 
809930 15,000 809930[__—sé3:98 J 15,000 
Note: Cycle Count 


See Section 8.5.2, Data Loads and Stores, on page 8-24 for the effects of 
operand ordering on the cycle count. 
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Syntax 


Operation 


Operands 


Opcode 


Description 


Cycles 
Status Bits 


Parallel ADDI3 and ST! ADDI3\||STI 


ADDI3 — src2, src1, dst? 
|| STI src3, dst2 


src? + src2 — dst? 
|| src3 — dst2 


src1_ register (Rn1i,0<n1 <7) 
src2__ indirect (disp = 0, 1, IRO, IR1) 
dst? register (Rn2, 0 < n2 <7) 
src3_ register (Rn8, 0 < n3 <7) 
dst2 indirect (disp = 0, 1, IRO, IR1) 


This instruction’s operands have been augmented in the following devices: 


_j ‘C31 silicon revision 6.0 or greater 
J ’C2 silicon revision 2.0 or greater 
 ’VC33 


src1_ register (Rn1i,0<n1 <7) 

src2__ indirect (disp = 0, 1, IRO, IR1) or any CPU register 
dst? register (Rn2,0< n2<7) 

src3_ register (Rn8, 0 < n3 < 7) 

dst2 indirect telise =O 1 1ROARH 


31 2423 1615 8 7 0 


An integer addition and an integer store are performed in parallel. All registers 
are read at the beginning and loaded at the end of the execute cycle. If one 
of the parallel operations (STI) reads from a register and the operation being 
performed in parallel (ADDI3) writes to the same register, ST| accepts the con- 
tents of the register as input before it is modified by the ADDIS3. 


If src2 and dst2 point to the same location, src2 is read before the write to d'st2. 
1 
These condition flags are modified only if the destination register is R7—RO. 


LUF Unaffected 

LV 1 if an integer overflow occurs; unchanged otherwise 
UF 0 

N 1 if a negative result is generated; 0 otherwise 

Z 1 if a O result is generated; 0 otherwise 

V 1 if an integer overflow occurs; 0 otherwise 

Cc 1 if a carry occurs; 0 otherwise 


Assembly Language Instructions 13-61 


ADDI3||STI = Parallel ADDI3 and STI 


Mode Bit OVM Operation is affected by OVM bit value. 
Example ADDI3 *ARO-—(IRO) ,R5,RO 
| STI R3, *AR7 


Cc 

Data memory 
80992C 
80983B 


Note: 


Before Instruction 
00 0000 0000 


00 0000 0035 
00 0000 OODC 
80 992C 
80 983B 


{eo} 


53 


220 


12C] 300 


LU 


Cycle Count 


After Instruction 


[00 0000 0208] 520 
[00 0000 0035) 53 
[00 0000 cope] 220 
[a0 9920] 
[0 9838] 
Ld 

[ 
SS 
Ld 
—a 
eT) 
eT) 
[2c] 300 
eS 


See Section 8.5.2, Data Loads and Stores, on page 8-24 for the effects of 
operand ordering on the cycle count. 
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Syntax 
Operands 
Operands 


Opcode 


Description 


Cycles 
Status Bits 


Mode Bit 


Example 


Bitwise-Logical AND 


AND src, dst 
dst AND src > dst 
src general addressing modes (G): 


00 any CPU register 

01. direct 

10 _ indirect (disp = 0-255, IRO, IR1) 
11 immediate (not sign extended) 


dst any CPU register 


15 8 7 


31 24 23 16 


AND 


0 


The bitwise-logical AND between the dst and src operands is loaded into the 
dst register. The dst and src operands are assumed to be unsigned integers. 


1 


These condition flags are modified only if the destination register is R7—RO. 


LUF Unaffected 

LV Unaffected 

UF 0 

N MSB of the output 

Z 1 if a O result is generated; 0 otherwise 
Vv 0 

Cc Unaffected 


OVM Operation is not affected by OVM bit value. 


AND R1,R2 

Before Instruction After Instruction 

Rt Rt 
R2 R2 
LuFL_ wwF Lo 
wl vw [oo 
UPL UF [ 
NL ol NCL 
ee | zCL 
vL__ v Ld 
c C 


Assembly Language Instructions 
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AND3 _ Bitwise-Logical AND, 3-Operand 


Syntax AND3 src2, src1, dst 
Operation src? AND src2 > dst 
Operands src? 3-operand addressing modes (T): 


00 any CPU register 
01 indirect (disp = 0, 1, IRO, IR1) 
10 any CPU register 
11. indirect (disp = 0, 1, IRO, IR1) 


src2 3-operand addressing modes (1): 


00 any CPU register 
01 any CPU register 
10 _ indirect (disp = 0, 1, IRO, IR1) 
11 indirect (disp = 0, 1, IRO, IR1) 


Opcode 
31 24 23 1615 87 ) 
Description The bitwise-logical AND between the src7 and src2 operands is loaded into 
the destination register. The src7, src2, and dst operands are assumed to be 
unsigned integers. 
Cycles 1 
Status Bits These condition flags are modified only if the destination register is R7—-RO. 
LUF Unaffected 
LV Unaffected 
UF 0 
N MSB of the output 
Z 1 if a O result is generated; 0 otherwise 
Vv 0 
Cc Unaffected 
Mode Bit OVM = Operation is not affected by OVM bit value. 
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Bitwise-Logical AND, 3-Operand AND3 


Example 1 AND3 *ARO--(IRO) ,*+AR1,R4 
Before Instruction After Instruction 
Ra Ra 
ARO ARO 
ARI ARI 
IRO IRO 
LuF[ LuF [i] 
wl wv [oo] 
uC ue [i] 
N[T N CL 
zt z Lo] 
v~E v [a] 
cL ¢ Co] 
Data memory 
8098F4h 8098F4h 
809952h 123 809952h 123 
Example 2 AND3 *-AR5,R7,R4 
Before Instruction After Instruction 
RA RA 
ARS ARS 
iUF [= LurF [_ 
wl w [oo 
ui u [io 
NC N Cd 
zl z Ld 
VEL v Lol 
cL ec Lo 
Data memory 
80985Bh 80985Bh 
Note: Cycle Count 


See Section 8.5.2, Data Loads and Stores, on page 8-24 for the effects of 
operand ordering on the cycle count. 
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AND3||STI 


Syntax 


Operation 


Operands 


Opcode 


Description 


Cycles 


13-66 


Parallel AND3 and STI 


ANDS3 §src2, src1, dst1 


| STI src3, dst2 

src? AND src2 > dst1 
|| src3 — dst2 
src? register (Rni,0<n1 <7) 
src2__ indirect (disp = 0, 1, IRO, IR1) 
dst? register (Rn2,0 <n2 <7) 
src3_ register (Rn8,0 <n3 <7) 
dst2 indirect (disp = 0, 1, IRO, IR1) 


srct 
src2 
dst1 
src3 


This instruction’s operands have been augmented in the following devices: 


_j ’C31 silicon revision 6.0 or greater 
_J ’C32 silicon revision 2.0 or greater 
Lj ‘VC33 


register (Rn1, 0< n1 <7) 
indirect (disp = 0, 1, IRO, IR1) or any CPU register 
register (Rn2, 0 < n2 <7) 
register (Rn38, 0 < n3 <7) 


31 


dst2 


indirect{eisp=04+ IROARH 


Z 


24 23 1615 8 0 


A bitwise-logical AND and an integer store are performed in parallel. All regis- 
ters are read at the beginning and loaded at the end of the execute cycle. If 
one of the parallel operations (STI) reads from a register and the operation be- 
ing performed in parallel (AND3) writes to the same register, STI accepts the 
contents of the register as input before it is modified by the AND3. 


If src2 and dsi2 point to the same location, src2 is read before the write to dst2. 


{ 


Running Title—Attribute Reference mnemonic 


Status Bits These condition flags are modified only if the destination register is R7—RO. 


LUF Unaffected 
LV Unaffected 


UF 0 

N MSB of the output 

Z 1 if a O result is generated; 0 otherwise 
Vv 0 

Cc Unaffected 


Chapter Title—Attribute Reference 13-67 


AND3||STI Parallel! AND3 and STI 


Mode Bit OVM Operation is not affected by OVM bit value. 
Example AND3 *+AR1(IRO),R4,R7 
| | STI R3,*AR2 
Before Instruction After Instruction 
RO RO 
R3 53 R3 53 
R4 R4 
R7 R7 
ARI ARI 
AR2 AR2 
LuF[ DF uur (_d 
1 u [SD 
nL N[ 
z— 7 zd 
vE_ vd 
cL cL 
Data memory 
8099F9h 8099F9h 
sosssFhL___—sss—~Cid: 80983Fh 53 


Note: Cycle Count 


See Section 8.5.2, Data Loads and Stores, on page 8-24 for the effects of 
operand ordering on the cycle count. 
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Bitwise-Logical AND With Complement ANDN 


Syntax ANDN src, dst 
Operation dst AND ~src > dst 
Operands src general addressing modes (G): 


00 any CPU register 

01 direct 

10 _ indirect (disp = 0-255, IRO, IR1) 
11 immediate (not sign extended) 


dst any CPU register 


Opcode 
31 24 23 1615 87 0 
pooooorsoe] ow | ve 

Description The bitwise-logical AND between the dst operand and the bitwise-logical com- 
plement (~) of the src operand is loaded into the dst register. The dst and src 
operands are assumed to be unsigned integers. 

Cycles 1 

Status Bits These condition flags are modified only if the destination register is R7-RO. 
LUF Unaffected 
LV Unaffected 
UF 0 
N MSB of the output 
Z 1 if a O result is generated; 0 otherwise 
Vv 0 
Cc Unaffected 

Mode Bit OVM Operation is not affected by OVM bit value. 
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ANDN _Bitwise-Logical AND With Complement 


Example ANDN @980Ch, R2 


Before Instruction 


R2 
DP 
LuF[ 
w[ 
UF[__ 
NL 
ZL 
VL 
cL 
Data memory 


80980Ch 
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R2 
DP 
LUF 
LV 
UF 


QO < N Z 


80980Ch 


After Instruction 
00 0000 042D 


080 


[Ld 
Ld 
Ll 
[Ld 
Ld 
Ld 
[dl 


OA02 


Syntax 
Operation 


Operands 


Opcode 


Description 


Cycles 


Status Bits 


Mode Bit 


Bitwise-Logical ANDN, 3-Operand ANDN3 


ANDN3 src2, src1, dst 
src? AND ~src2 > dst 
src? 3-operand addressing modes (T): 


00 any CPU register 
01 indirect (disp = 0, 1, IRO, IR1) 
10 any CPU register 
11 indirect (disp = 0, 1, IRO, IR1) 


src2 3-operand addressing modes (T): 


00 any CPU register 

01 any CPU register 
indirect (disp = 0, 1, IRO, IR1) 
indirect (disp = 0, 1, lOO, IR1) 


dst register (Rn, 0 <n<27) 


31 24 23 1615 8 7 0 


The bitwise-logical AND between the src? operand and the bitwise-logical 
complement (~) of the src2 operand is loaded into the dst register. The src7, 
src2, and dst operands are assumed to be unsigned integers. 


1 


These condition flags are modified only if the destination register is R7—RO. 


LUF Unaffected 

LV Unaffected 

UF 0 

N MSB of the output 

Z 1 if a O result is generated; 0 otherwise 
Vv 0 

Cc Unaffected 


OVM Operation is not affected by OVM bit value. 
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ANDN3 _ Bitwise-Logical ANDN, 3-Operand 


Example 1 ANDN3 R5,R3,R7 
Before Instruction After Instruction 
R3 R3 
R5 R5 
R7 R7 
wuF[ LF [| 
vf wv [ 
ue SS u [dd 
NC nC dg 
zd z Ld 
vL__ vEL 
cL c [Ld 
Example 2 ANDN3 R1,*AR5++(IRO) ,RO 
Before Instruction After Instruction 
RO RO 
Rt Rt 
ARS AR5 
IRO IRO 
LuF[ LF [ 
vl vf 
uF u [” 
vO nC q 
zl zL 
v~E vbEL 
cL cL 
Data memory 
809825h 809825h 


Note: Cycle Count 


See Section 8.5.2, Data Loads and Stores, on page 8-24 for the effects of 
operand ordering on the cycle count. 
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Syntax 


Operation 


Operands 


Opcode 


Description 


Cycles 


Arithmetic Shift ASH 


ASH count, dst 


If (count = 0): 
dst << count > dst 


Else: 
dst >> |count | > dst 


count general addressing modes (G): 


00 any CPU register 

01 direct 

10 _ indirect (disp = 0-255, IRO, IR1) 
11 immediate 


dst any CPU register 


31 24 23 1615 8 7 0 


The seven LSBs of the count operand are used to generate the 2s-comple- 
ment shift count of up to 32 bits. 


If the count operand is greater than 0, the dst operand is left shifted by the value 
of the count operand. Low-order bits that are shifted in are zero filled, and high- 
order bits are shifted out through the carry (C) bit. 


Arithmetic left shift: 
Ccdstc 0 


If the count operand is less than 0, the dst operand is right shifted by the abso- 
lute value of the count operand. The high-order bits of the dst operand are sign- 
extended as it is right shifted. Low-order bits are shifted out through the C bit. 


Arithmetic right shift: 
sign of dst > dst > C 


If the count operand is 0, no shift is performed, and the C bit is set to 0. The 
count and dst operands are assumed to be signed integers. 


1 
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ASH _ = Arithmetic Shift 


Status Bits These condition flags are modified only if the destination register is R7—RO. 


LUF Unaffected 


LV 1 if an integer overflow occurs; unchanged otherwise 
UF 0 
N MSB of the output 
Z 1 if a O result is generated; 0 otherwise 
Vv 1 if an integer overflow occurs; 0 otherwise 
Cc Set to the value of the last bit shifted out; 0 for a shift count of 0 
Mode Bit OVM Operation is not affected by OVM bit value. 
Example 1 ASH R1,R3 
Before Instruction After Instruction 
Rt 16 Rt 
R3 R3 
LuF[ LUF 
v{[_ LV 
UPL UF 
N[_ N 
zd z 
vL CD v 
cL C 
Example 2 ASH @98C3h,R5 
Before Instruction After Instruction 
R5 R5 
DP DP 
LurF[_ LurF [_ 
v{[ vf 
uFL___ uF [__ 
nC N 
zDD zg 
VD 1 ——— ~=9l 
cL C 


Data memory 


8098C3h -24  sogecsh [_orres]-24 
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Syntax 


Operation 


Operands 


Opcode 


Description 


Cycles 


Arithmetic Shift, 3-Operand ASH3 


ASH3 count, src, dst 
If (count = 0): 
src << count > dst 


Else: 
src >> |count | > dst 


count 3-operand addressing modes (T): 


00 register (Rn2, 0 < n2< 27) 
01 register (Rn2,0< n2< 27) 
10 _ indirect (disp = 0, 1, IRO, IR1) 
11. indirect (disp = 0, 1, IRO, IR1) 


src 3-operand addressing modes (T): 


00 register (Rni,O<n1 < 27) 
01 indirect (disp = 0, 1, IRO, IR1) 
10 register (Rn1,0<n1 < 27) 
11 indirect (disp = 0, 1, IRO, IR1) 


dst register (Rn, 0 <n<27) 


31 24 23 1615 8 7 0 


The seven LSBs of the count operand are used to generate the 2s-comple- 
ment shift count of up to 32 bits. 


If the count operand is greater than 0, the src operand is left shifted by the value 
of the count operand. Low-order bits that are shifted in are zero filled, and high- 
order bits are shifted out through the status register’s C bit. 


Arithmetic left shift: 
Cesre<cO0 


If the count operand is less than 0, the src operand is right shifted by the abso- 
lute value of the count operand. The high-order bits of the src operand are sign 
extended as they are right shifted. Low-order bits are shifted out through the 
C (carry) bit. 
Arithmetic right shift: 

sign of src > src > C 


If the count operand is 0, no shift is performed, and the C bit is set to 0. The 
count, src, and dst operands are assumed to be signed integers. 


1 
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ASH3 = Arithmetic Shift, 3-Operand 


Status Bits These condition flags are modified only if the destination register is R7—RO. 


LUF Unaffected 


LV 1 if an integer overflow occurs; unchanged otherwise 
UF 0 
N MSB of the output 
Z 1 if a O result is generated; 0 otherwise 
V 1 if an integer overflow occurs; 0 otherwise 
Cc Set to the value of the last bit shifted out; 0 for a shift count of 0 
Mode Bit OVM Operation is not affected by OVM bit value. 
Example 1 ASH3 *AR3-—(1),R5,RO0 
Before Instruction After Instruction 
RO RO 
RS R5 
AR3 AR3 
wr[ wr [dd 
v[ vw Lo 
UF[_ uF [__ 
Nj N Cg 
z— 9] z Ld 
vL_ v [> 
cL cL 
Data memory 
809921h 16 809921h 16 
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Arithmetic Shift, 3-Operand ASH3 


Example 2 ASH3 R1,R3,R5 

Before Instruction After Instruction 

Ri -8 Rt -8 
R3 R3 
R5 R5 00 FFFF FFCB 
2 | uF [_ 
vt w [od 
ud u [dj 
NC _q N 
zo z Ld 
v[E vE_ 
cL c [Td 


Note: Cycle Count 


See Section 8.5.2, Data Loads and Stores, on page 8.5.2 for the effects of 
operand ordering on the cycle count. 
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ASH3||STI 


Syntax 


Operation 


Operands 


Opcode 


Description 


13-78 


Parallel ASH3 and STI 


ASH3_ count, src2, dst1 
|| STI src3, dst2 


If (count = 0): 
src2 << count > dst1 
Else: 


src2 >> |count| > dst1 
|| src3 > dst2 


count register (Rn1,0<n1 <7) 


src2__ indirect 
dst? register 
src3 register 
dst2 indirect 


— 


disp = 0, 1, IRO, IR1) 
Rn2,0 <n2 <7) 
Rn3,0 <1n3 <7) 
disp = 0, 1, IRO, IR1) 


—~ ~ a 


This instruction’s operands have been augmented in the following devices: 


_j ’C31 silicon revision 6.0 or greater 
_j ’C32 silicon revision 2.0 or greater 
LJ ’VC33 


src2__ indirect (disp = 0, 1, IRO, IR1) or any CPU register 
dst? register (Rn1,0<1n1 <7) 

src3_ register (Rn2, 0 < n2 <7) 

dsi2—indireet disp =O ROAR 


The seven LSBs of the count operand register are used to generate the 2s- 
complement shift count of up to 32 bits. 


If the count operand is greater than 0, the src2 operand is left shifted by the 
value of the count operand. Low-order bits shifted in are zero filled, and high- 
order bits are shifted out through the C bit. 


Arithmetic left shift: 
Ce sre2<0 


If the count operand is less than 0, the src2 operand is right hifted by the abso- 
lute value of the count operand. The high-order bits of the src2 operand are 
sign extended as they are right shifted. Low-order bits are shifted out through 
the C bit. 


Cycles 


Status Bits 


Mode Bit 


Parallel ASH3 and ST|_ ASH3||STI 


Arithmetic right shift: 


sign of src2 > src2 > C 


If the count operand is 0, no shift is performed, and the C bit is set to 0. The 
count and dst operands are assumed to be signed integers. 


All registers are read at the beginning and loaded at the end of the execute 
cycle. If one of the parallel operations (STI) reads from a register and the oper- 
ation being performed in parallel (ASH3) writes to the same register, STI ac- 
cepts the contents of the register as input before it is modified by the ASH3. 


If src2 and dsit2 point to the same location, src2 is read before the write to dst2. 
1 
These condition flags are modified only if the destination register is R7—RO. 


LUF Unaffected 

LV 1 if an integer overflow occurs; unchanged otherwise 

UF 0 

N MSB of the output 

Z 1 if a O result is generated; 0 otherwise 

V 1 if an integer overflow occurs; 0 otherwise 

Cc Set to the value of the last bit shifted out; 0 for a shift count of 0 


OVM Operation is not affected by OVM bit value. 
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ASH3||STI Paralle! ASH3 and STI 


Example ASH3 R1, *AR6++(IR1) ,RO 
| | STI R5,*AR2 
Before Instruction After Instruction 
RO 00 0000 0000 RO 00 FFFF FFAE 
R1 00 0000 FFE8| -24 R1 00 0000 FFE8] -24 
R5 00 0000 0035} 53 R5 00 0000 0035] 53 
AR2 80 98A2 AR2 80 98A2 
AR6 80 9900 AR6 80 998C 
IR1 8 
LUF LUF 
LV LV 
UF UF 


QO < N Z 


UU 


Cc 

Data memory 
809900h 0AE000000 809900h OAE000000 

8098A2h 8098A2h 3 53 


W QUO 


Note: Cycle Count 


See Section 8.5.2, Data Loads and Stores, on page 8-24 for the effects of 
operand ordering on the cycle count. 
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Syntax 


Operation 


Operands 


Opcode 


Description 


Cycles 
Status Bits 


Mode Bit 


Branch Conditionally (Standard) Bcond 


Bcond src 


If cond is true: 
If src is in register-addressing mode (Rn, 0 < n<27), 
src + PC. 
If src is in PC-relative mode (label or address), 
displacement + PC + 1 > PC. 
Else, continue 


src conditional-branch addressing modes (B): 


0 register 
1 PC relative 


31 15 8 7 


24 23 16 0 
0.1.1 0.4 SEE 0 0 0} cond | Register or displacement 


Bcondsignifies a standard branch that executes in four cycles. A branch is per- 
formed if the condition is true (since a pipeline flush also occurs on a true condi- 
tion; see Section 8.2, Pipeline Conflicts, on page 8-4). If the src operand is ex- 
pressed in register addressing mode, the contents of the specified register are 
loaded into the PC. If the src operand is expressed in PC-relative mode, the 
assembler generates a displacement: displacement = label — (PC of branch 
instruction + 1). This displacement is stored as a 16-bit signed integer in the 
16 LSBs of the branch instruction word. This displacement is added to the PC 
of the branch instruction plus 1 to generate the new PC. 


The ’C8x provides 20 condition codes that you can use with this instruction 
(see Table 13-12 on page 13-30 for a list of condition mnemonics, condition 
codes and flags). Condition flags are set on a previous instruction only when 
the destination register is one of the extended-precision registers (R7—RO) or 
when one of the compare instructions (CMPF, CMPF3, CMPI, CMPI3, TSTB, 
or TSTB3) is executed. 


LUF Unaffected 
LV Unaffected 
UF Unaffected 
N Unaffected 
Zz Unaffected 
V Unaffected 
Cc Unaffected 


OVM = Operation is not affected by OVM bit value. 


Assembly Language Instructions 13-81 


Bcond = Branch Conditionally (Standard) 


Example BZ RO 
Before Instruction After Instruction 
RO RO 
PC PC 
LuF[__ LurF [Oo 
vE sid’ A | 
UPL uF [oo] 
NL N Lol 
Z z 
vo v Ld 
cL c Lo 

Note: 


If a BZ instruction is executed immediately following a RND instruction with 
a 0 operand, the branch is not performed, because the 0 flag is not set. To 
circumvent this problem, execute a BZUF instead of a BZ instruction. 
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Syntax 


Operation 


Operands 


Opcode 


Description 


Cycles 
Status Bits 


Mode Bit 


Branch Conditionally (Delayed) BcondD 


BcondD src 


If cond is true: 
If src is in register-addressing mode (Rn, 0 < n< 27), 
src > PC. 
If src is in PC-relative mode (label or address), 
displacement + PC + 3 — PC. 
Else, continue 


src conditional-branch addressing modes (B): 


0 register 
1 PC relative 


31 24 23 1615 8 7 0 


011010 plo 0 0|1| cond | Register or displacement 


BcondD signifies a delayed branch that allows the three instructions after the 
delayed branch to be fetched before the PC is modified. The effect is a single- 
cycle branch, and the three instructions following BcondD do not affect the 
condition. 


A branch is performed if the condition is true. If the src operand is expressed 
in register-addressing mode, the contents of the specified register are loaded 
into the PC. If the src operand is expressed in PC-relative mode, the assembler 
generates a displacement: displacement = label — (PC of branch instruction 
+ 3). This displacement is stored as a 16-bit signed integer in the 16 LSBs of 
the branch instruction. This displacement is added to the PC of the branch in- 
struction plus 3 to generate the new PC. The ’C3x provides 20 condition codes 
that you can use with this instruction (see Table 13-12 on page 13-30 for a list 
of condition mnemonics, condition codes, and flags). Condition flags are set 
on the previous instruction only when the destination register is one of the ex- 
tended-precision registers (R7—RO) or when one of the compare instructions 
(CMPF, CMPF3, CMPI, CMPI3, TSTB, or TSTB3) is executed. 


LUF Unaffected 
LV Unaffected 
UF Unaffected 
N Unaffected 
Zz Unaffected 
V Unaffected 
Cc Unaffected 


OVM = Operation is not affected by OVM bit value. 
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BcondD Branch Conditionally (Delayed) 


Example BNZD 36 (36 = 24h) 

Before Instruction After Instruction 

PC PC 

uF Si tur [dd 

i || vw [oo 

uF[ UP [ 
NL N [OO 
zd zC 4 
vo" 7 
clo] ¢ Lal 

Note: 


Delayed branches disable interrupts until the completion of the three instruc- 
tions that follow the delayed branch, regardless if the branch is or is not per- 
formed. The following instructions cannot be used in the next three instruc- 
tions following a delayed branch: Bcond, BconaD, BR, BRD, CALL, CALL- 
cond, DBcond, DBconaD, IDLE, IDLE2, RETlcond, RETScond, RPTB, 
RPTS, TRAPcond. 


ss | 
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Syntax 
Operation 
Operands 
Opcode 


Description 


Cycles 


Status Bits 


Mode Bit 


Example 


Branch Unconditionally (Standard) BR 


BR src 
src — PC 


src long-immediate addressing mode 


31 24 23 1615 8 7 0 


01 10000)0| src 


BR performs a PC-relative branch that executes in four cycles, since a pipeline 
flush also occurs upon execution of the branch (see Section 8.2, Pipeline Con- 
flicts, on page 8-4). An unconditional branch is performed. The src operand is 
assumed to be a 24-bit unsigned integer. Note that bit 24 = 0 for a standard 
branch. 


LUF Unaffected 
LV Unaffected 
UF Unaffected 
N Unaffected 
Z Unaffected 
V Unaffected 
Cc Unaffected 


OVM Operation is not affected by OVM bit value. 


BR 805Ch 

Before Instruction After Instruction 

PC PC 

LuF[_ uF [ 
wl vw fd 

U[_ uP [ 
NL NL 

z— dd zf 
vo 2 
cfd os | 
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BRD Branch Unconditionally (Delayed) 


Syntax 
Operation 
Operands 
Opcode 


Description 


Cycles 


Status Bits 


Mode Bit 


Example 


13-86 


BRD src 
src > PC 


src long-immediate addressing mode 


31 24 23 16 15 


01 10000)1| src 


BRD signifies a delayed branch that allows the three instructions after the 
delayed branch to be fetched before the PC is modified. The effect is a 


single-cycle branch. 


An unconditional branch is performed. The src operand is assumed to be a 


24-bit unsigned integer. Note that bit 24 = 1 for a delayed branch. 


LUF Unaffected 
LV Unaffected 
UF Unaffected 


N Unaffected 
Z Unaffected 
V Unaffected 
Cc Unaffected 


OVM = Operation is not affected by OVM bit value. 


BRD 2Ch 
Before Instruction 
PC PC 
LuF[_ LUF 
wld LV 
UPL UF 
NC N 
zt Z 
v7 v 
co” c 


After Instruction 


002C 


Syntax 


Operation 


Operands 


Opcode 


Description 


Cycles 


Status Bits 


Mode Bit 


Example 


Call Subroutine CALL 


CALL src 


Next PC —> *+4+SP 
src + PC 


src long-immediate addressing mode 


31 24 23 1615 8 7 0 


01100040 src 


A call is performed. The next PC value is pushed onto the system stack. The 
src operand is loaded into the PC. The src operand is assumed to be a 24-bit 
unsigned-immediate operand. Since the CALL instruction takes 4 cycles to 
execute, the pipeline is flushed. 


LUF Unaffected 
LV Unaffected 
UF Unaffected 
N Unaffected 
Z Unaffected 
V Unaffected 
Cc Unaffected 


OVM Operation is not affected by OVM bit value. 


CALL 123456h 


Before Instruction After Instruction 
PC PC 
SP SP 
uF[ wwF Lo 
ve) wv Lo 
UPL] UP [_ 
NL N [ol 
z[ zLd 
v~E | 
cL cL 
Data memory 
gogso2zh [| 


Assembly Language Instructions 13-87 


CALLcond 


Syntax 


Operation 


Operands 


Opcode 


Description 


Cycles 
Status Bits 


Mode Bit 


13-88 


Call Subroutine Conditionally 


CALLcond src 


If cond is true: 
Next PC — *++SP 
If src is in register addressing mode (Rn, 0 <n<27), 
src + PC. 
If src is in PC-relative mode (label or address), 
displacement + PC + 1 — PC. 


Else, continue 
src conditional-branch addressing modes (B): 


0 register 
1 PC relative 


15 8 7 0 


31 24 23 16 
011410 08/0 00 0) cond Register or displacement 


A call is performed if the condition is true. If the condition is true, the next PC 
value is pushed onto the system stack. If the src operand is expressed in regis- 
ter addressing mode, the contents of the specified register are loaded into the 
PC. If the src operand is expressed in PC-relative mode, the assembler gener- 
ates a displacement: displacement = label — (PC of call instruction + 1). This 
displacement is stored as a 16-bit signed integer in the 16 LSBs of the call in- 
struction word. This displacement is added to the PC of the call instruction plus 
1 to generate the new PC. This instruction flushes the pipeline as shown in 
Example 8-13 on page 8-18. 


The ’C8x provides 20 condition codes that can be used with this instruction 
(see Table 13-12 on page 13-30 for a list of condition mnemonics, condition 
codes, and flags). Condition flags are set on a previous instruction only when 
the destination register is one of the extended-precision registers (R7—RO) or 
when one of the compare instructions (CMPF, CMPF3, CMPI, CMPI8, TSTB, 
or TSTB3) is executed. 


LUF Unaffected 
LV Unaffected 
UF Unaffected 


N Unaffected 
Z Unaffected 
Vv Unaffected 
Cc Unaffected 


OVM = Operation is not affected by OVM bit value. 


Example CALLNZ R5 


Before Instruction 

R5 
PC 
SP 
urF[ 


cL 


Data memory 


Assembly Language Instructions 


Call Subroutine Conditionally CALLcond 


After Instruction 


R5 
PC 
SP 
LUF 
LV 
UF 


QO< N Z 


HN 


809836h 124 


13-89 


CMPF Compare Floating-Point Value 


Syntax 
Operation 


Operands 


Opcode 


Description 


Cycles 


Status Bits 


Mode Bit 


13-90 


CMPF src, dst 
dst - src 
src general addressing modes (G): 


00 register (Rn,O<n<7) 

01 direct 

10 _ indirect (disp = 0-255, IRO, IR1) 
11 immediate 


dst register (Rn, 0 <n<7) 


31 2423 1615 iS. 7 0 


The src operand is subtracted from the dst operand. The result is not loaded 
into any register, which allows for nondestructive compares. The dst and src 
operands are assumed to be floating-point numbers. 


{ 


These condition flags are modified for all destination registers (R27-RO). 


LUF (1 ifa floating-point underflow occurs; unchanged otherwise 
LV 1 if a floating-point overflow occurs; unchanged otherwise 
UF 1 if a floating-point underflow occurs; 0 otherwise 

N 1 if a negative result is generated; 0 otherwise 

Z 1 if a O result is generated; 0 otherwise 

Vv 1 if a floating-point overflow occurs; 0 otherwise 


Cc Unaffected 
OVM Operation is not affected by OVM bit value. 


Compare Floating-Point Value CMPF 


Example CMPF *+AR4,R6 

Before Instruction After Instruction 

R6 1.4050e+02 R6 1.4050e+02 
AR4 AR4 
LuF{[_ LuF[_ 
wl wl] 
UPL uUF[_ 
Cd NC 9] 
2. z 
vL_ v[E 
cL cL] 


Data memory 


8098F3h 1.4050e+02  8098F3h 1.4050e+02 


Assembly Language Instructions 13-91 


CMPF3 Compare Floating-Point Value, 3-Operand 


Syntax 
Operation 


Operands 


Opcode 


Description 


Cycles 


Status Bits 


Mode Bit 


13-92 


CMPF3 src2, src1 


sre? — srce2 


src? 3-operand addressing modes (T): 


register (Rn1, 0 < n1 <7) 
indirect (disp = 0, 1, IRO, IR1) 
register (Rn1, 0 < n1 <7) 
indirect (disp = 0, 1, IRO, IR1) 


src2 3-operand addressing modes (T): 


31 


{ 


These condition flags are modified for all destination registers (R27-RO). 


LUF 
LV 


UF 
N 
Z 
Vv 
Cc 


OVM 


a ee eee ees 


se 


register 
register 
indirect 
indirect 


Rn2, 0 < n2 <7) 
Rn2, 0 < n2<7) 
disp = 0, 1, IRO, IR1) 
disp = 0, 1, IRO, IR1) 


a~ a ~~ 


15 


24 23 16 8 7 0 


The src2 operand is subtracted from the src? operand. The result is not loaded 
into any register, which allows for nondestructive compares. The src7 and src2 
operands are assumed to be floating-point numbers. Although this instruction 
has only two operands, it is designated as a 3-operand instruction because op- 
erands are specified in the 3-operand format. 


if a floating-point underflow occurs; unchanged otherwise 
if a floating-point overflow occurs; unchanged otherwise 
if a floating-point underflow occurs; 0 otherwise 

if a negative result is generated; 0 otherwise 

if a O result is generated; 0 otherwise 

if a floating-point overflow occurs; 0 otherwise 


Unaffected 


Operation is not affected by OVM bit value. 


Compare Floating-Point Value, 3-Operand CMPF3 


Example CMPF3 *AR2, *AR3--(1) 
Before Instruction . After Instruction 
AR2 AR2 
AR3 AR4 
We, uF [J 
wf] vw Ld 
uF [Ot] uF [dl 
nC 9] N 
zC 9] zd 
vo] v Ld 
cL] c [Ld 
Data memory 
809831h [77a7000] 2.5044e+02 809831h L77a7000) 2.5044e+02 
809852h [57a2000]6.253125e+01 809852h [57a2000]6.253125¢+01 
Note: Cycle Count 


See Section 8.5.2, Data Loads and Stores, on page 8-24 for the effects of 
operand ordering on the cycle count. 


Assembly Language Instructions 


13-93 


CMPI Compare Integer 


Syntax 
Operation 


Operands 


Opcode 


Description 


Cycles 
Status Bits 


Mode Bit 


Example 


13-94 


CMPI src, dst 
dst - src 
src general addressing modes (G): 


00 register (Rn, 0 < n< 27) 

01 direct 

10 _ indirect (disp = 0-255, IRO, IR1) 
11 immediate 


dst register (Rn, 0 < n< 27) 


31 24 23 1615 8 7 0 


The src operand is subtracted from the dst operand. The result is not loaded 
into any register, thus allowing for nondestructive compares. The dst and src 
operands are assumed to be signed integers. 


1 
These condition flags are modified for all destination registers (R27-RO). 


LUF Unaffected 


LV 1 if an integer overflow occurs; unchanged otherwise 
UF 0 

N 1 if a negative result is generated; 0 otherwise 

Z 1 if a O result is generated; 0 otherwise 

V 1 if an integer overflow occurs; 0 otherwise 

Cc 1 if a borrow occurs; 0 otherwise 


OVM Operation is not affected by OVM bit value. 


CMPI R3,R7 

Before Instruction After Instruction 

R3 2200 R3_ [_00 0000 0898] 2200 

R7 1000 R7 1000 
LuF[ uF Ld 
vl vf 
UPL__ uF [ 
N[ N 
zd zC__4 
vd vod 
cL C 


Syntax 
Operation 


Operands 


Opcode 


Description 


Cycles 


Status Bits 


Mode Bit 


CMPI3 


Compare Integer, 3-Operand CMPI3 


srce2, src1 


sre? — src2 


src? 3-operand addressing modes (T): 


register 
indirect 
register 
indirect 


Rni,0< ni <27) 
disp = 0, 1, IRO, IR1) 
Rni,0< ni <27) 
disp = 0, 1, IRO, IR1) 


~~ 


src2 3-operand addressing modes (T): 


00 
01 


31 


register (Rn2, 0 < n2 < 27) 
register (Rn2, 0 < n2 < 27) 
indirect (disp = 0, 1, IRO, IR1) 
indirect (disp = 0, 1, IRO, IR1) 


15 


24 23 16 8 7 0 


The src2 operand is subtracted from the src7 operand. The result is not loaded 
into any register, which allows for nondestructive compares. The src7 and src2 
operands are assumed to be signed integers. Although this instruction has 
only two operands, it is designated as a 3-operand instruction because oper- 
ands are specified in the 3-operand format. 


1 


These condition flags are modified for all destination registers (R27-RO). 


LUF 
LV 
UF 
N 

Z 

Vv 

Cc 


OVM 


Unaffected 

1 if an integer overflow occurs; unchanged otherwise 
0 

1 if a negative result is generated; 0 otherwise 

1 if a O result is generated; 0 otherwise 

1 if an integer overflow occurs; 0 otherwise 

1 if a borrow occurs; 0 otherwise 


Operation is not affected by OVM bit value. 


Assembly Language Instructions 


13-95 


CMPI3 Compare Integer, 3-Operand 


Example CMPI3 R7,R4 

Before Instruction After Instruction 

R4 2200 R4 [00 0000 0898] 2200 

R7 1000 R7 1000 
LuF[ uF [_d 
vl vf 
uFL___ uF [ 
nC qd C4 
zL_ z{ 
v~E 2) es | 
cL cL 


Note: Cycle Count 


See Section 8.5.2, Data Loads and Stores, on page 8-24 for the effects of 
operand ordering on the cycle count. 


13-96 


Syntax 


Operation 


Operands 


Opcode 


Description 


Decrement and Branch Conditionally (Standard) DBcond 


DBcond ARn, src 


ARn-1—ARn 
If cond is true and ARn=0: 
If src is in register addressing mode (Rn, 0< n< 27), 
src > PC. 
If src is in PC-relative mode (label or address), 
displacement + PC + 1 > PC. 
Else, continue 


src conditional-branch addressing modes (B): 


0 register 
1 PC relative 


ARn register (0 < n< 7) 


31 15 8 7 0 


24 23 16 
0141041 to] arn [0] cond | Register or displacement 


DBcond signifies a standard branch that executes in four cycles because the 
pipeline must be flushed if cond is true. The specified auxiliary register is de- 
cremented and a branch is performed if the condition is true and the specified 
auxiliary register is greater than or equal to 0. The condition flags are those set 
by the last previous instruction that affects the status bits. 


The auxiliary register is treated as a 24-bit signed integer. The 8 MSBs are un- 
modified by the decrement operation. The comparison of the auxiliary register 
uses only the 24 LSBs of the auxiliary register. Note that the branch condition 
does not depend on the auxiliary register decrement. 


If the src operand is expressed in register addressing mode, the contents of 
the specified register are loaded into the PC. If the src operand is expressed 
in PC-relative addressing mode, the assembler generates a displacement: 
displacement = label — (PC of branch instruction + 1). This integer is stored as 
a 16-bit signed integer in the 16 LSBs of the branch instruction word. This dis- 
placement is added to the PC of the branch instruction plus 1 to generate the 
new PC. 


The ’C8x provides 20 condition codes that can be used with this instruction 
(see Table 13-12 on page 13-30 for a list of condition mnemonics, condition 
codes, and flags). Condition flags are set on a previous instruction only when 
the destination register is one of the extended-precision registers (RO—R7) or 
when one of the compare instructions (CMPF, CMPF3, CMPI, CMPI3, TSTB, 
or TSTB3) is executed. 


Assembly Language Instructions 13-97 


DBcond Decrement and Branch Conditionally (Standard) 


Cycles 


Status Bits 


Mode Bit 


Example 


13-98 


Unaffected 
Unaffected 
Unaffected 
Unaffected 
Unaffected 
Unaffected 
Unaffected 


Operation is not affected by OVM bit value. 


200,R3 
AR3,R2 


Before Instruction 


R2 00 0000 009F 
R3 00 0000 0080 
AR3 00 0012 


R2 
R3 
AR3 
PC 
LUF 
LV 
UF 


QO< N Z 


After Instruction 
00 0000 009F 
00 0000 0080 

00 0011 


OO9F 


IVIL 


Syntax 


Operation 


Operands 


Opcode 


Description 


Decrement and Branch Conditionally (Delayed) DBcondD 


DBcondD AR», src 


ARn-1—ARn 
If cond is true and ARn = 0: 
If src is in register addressing mode (Rn, 0 < n< 27) 
src > PC 
If src is in PC-relative mode (label or address) 
displacement + PC + 3 > PC. 


Else, continue 
src conditional-branch addressing modes (B): 


0 register 
1 PC relative 


ARn register (0 < n< 7) 


15 


31 24 23 16 8 7 0 
011041 ta] arn |1] cond | Register or displacement 


DBcond D signifies a delayed branch that allows the three instructions after 
the delayed branch to be fetched before the PC is modified. The effect is a 
single-cycle branch. The specified auxiliary register is decremented, and a 
branch is performed if the condition is true and the specified auxiliary register 
is greater than or equal to 0. The condition flags are those set by the last pre- 
vious instruction that affects the status bits. The three instructions following the 
DBcondD do not affect the condition. 


The auxiliary register is treated as a 24-bit signed integer. The 8 MSBs are un- 
modified by the decrement operation. The comparison of the auxiliary register 
uses only the 24 LSBs of the auxiliary register. The branch condition does not 
depend on the auxiliary register decrement. 


If the src operand is expressed in register-addressing mode, the contents of 
the specified register are loaded into the PC. If the src is expressed in PC-rela- 
tive addressing, the assembler generates a displacement: displacement = la- 
bel — (PC of branch instruction + 3). This displacement is added to the PC of 
the branch instruction plus 3 to generate the new PC. Note that bit 21 = 1 for 
a delayed branch. 


The ’C8x provides 20 condition codes that you can use with this instruction 
(see Table 13-12 on page 13-30 for a list of condition mnemonics, condition 
codes, and flags). Condition flags are set on a previous instruction only when 
the destination register is one of the extended-precision registers (R7—RO) or 
when one of the compare instructions (CMPF, CMPF3, CMPI, CMPI3, TSTB, 
or TSTB3) is executed. 


Assembly Language Instructions 13-99 


DBcondD 


Cycles 


Status Bits 


Mode Bit 


Example 


13-100 


Decrement and Branch Conditionally (Delayed) 


Unaffected 
Unaffected 
Unaffected 
Unaffected 
Unaffected 
Unaffected 
Unaffected 


Operation is not affected by OVM bit value. 


26h,R2 
AR5, $+110h 


Before Instruction 


R2 00 0000 0026 
AR5 00 0067 


R2 
ARS 
PC 
LUF 
LV 
UF 


QO < N Z 


After Instruction 
00 0000 0026 
00 0066 


0210 


INIM 


Syntax 
Operation 


Operands 


Opcode 


Description 


Cycles 


Status Bits 


Mode Bit 


Floating-Point-to-Integer Conversion FIX 


FIX src, dst 
fix(src) > dst 
src general addressing modes (G): 


00 register (Rn, 0 <n<7) 

01 ~ direct 

10 _ indirect (disp = 0-255, IRO, IR1) 
11 immediate 


dst any CPU register 


31 24 23 1615 87 0 


The floating-point operand src is rounded down to the nearest integral value 
less than or equal to floating-point value, and the result is loaded into the dst 
register. The src operand is assumed to be a floating-point number and the dst 
operand a signed integer. 


The exponent field of the dst register (bits 39-32) is not modified. 


Integer overflow occurs when the floating-point number is too large to be repre- 
sented as a 32-bit 2s-complement integer. In the case of integer overflow, the 
result is saturated in the direction of overflow. 


1 
These condition flags are modified only if the destination register is R7—RO. 


LUF Unaffected 

LV 1 if an integer overflow occurs; unchanged otherwise 
UF 0 

N 1 if a negative result is generated; 0 otherwise 

Z 1 if a O result is generated; 0 otherwise 

V 1 if an integer overflow occurs; 0 otherwise 

Cc Unaffected 


OVM Operation is not affected by OVM bit value. 


Assembly Language Instructions 13-101 


FIX = Floating-Point-to-Integer Conversion 


Example FIX R1,R2 


R1 
R2 
LUF 
LV 
UF 


Oo = nN 2 


13-102 


Before Instruction 


OA 2820 0000) 1.3454e+3 
00 0000 0000 


[Ld 
[Ld 
[Ll 
[Ld 
CL 
eT) 
[Ld 


R1 
R2 
LUF 
LV 
UF 


QO<c N Z 


After Instruction 


OA 2820 0000] 13454e+3 
00 0000 0541] 1345 


[ 
[ 
[Ld 
[Ld 
Ld 
[Ld 
[ 


Syntax 


Operation 


Operands 


Opcode 


Description 


Cycles 


Parallel FIX and ST! FIX||STI 


FIX src2, dst? 
|| STI src3, dsi2 


fix(src2) — dst? 
\| src3 > dst2 


src2__ indirect 
dst? register 
src3_ register 
dst2 indirect 


— 


disp = 0, 1, IRO, IR1) 
Rni,0<n1 <7) 
Rn2, 0 < n2<7) 
disp = 0, 1, IRO, IR1) 


This instruction’s operands have been augmented in the following devices: 


J ’C31 silicon revision 6.0 or greater 
J ’C2 silicon revision 2.0 or greater 
L) ’VC33 


src2__ indirect (disp = 0, 1, IRO, IR1) or any CPU register 
dst? register (Rn1i,0<n1 <7) 
src3_ register (Rn2, 0 < n2 <7) 

indi isp = 8, + ROAR 


31 24 23 1615 8 7 0 
01010 “as [0.0 0 src3 dsi2 src2 


A floating-point-to-integer conversion is performed. All registers are read at 
the beginning and loaded at the end of the execute cycle. This means that, if 
one of the parallel operations (STI) reads from a register, and the operation 
being performed in parallel (FIX) writes to the same register, STI accepts the 
contents of the register as input before it is modified by FIX. 


If src2 and dst2 point to the same location, src2 is read before the write to dst2. 


Integer overflow occurs when the floating-point number is too large to be rep- 
resented as a 32-bit 2s-complement integer. In the case of integer overflow, 
the result is saturated in the direction of overflow. 


1 


Assembly Language Instructions 13-103 


FIX||STI = Parallel! FIX and STI 


Status Bits These condition flags are modified only if the destination register is R7—RO. 
LUF Unaffected 
LV 1 if an integer overflow occurs; unchanged otherwise 
UF 0 
N 1 if a negative result is generated; 0 otherwise 
Z 1 if a O result is generated; 0 otherwise 
Vv 1 if an integer overflow occurs; 0 otherwise 
Cc Unaffected 
Mode Bit OVM Operation is not affected by OVM bit value. 
Example FIX *++AR4 (1) ,R1 
| | STI RO, *AR2 
Before Instruction After Instruction 
RO 220 RO 220 
Rt Rt 179 
AR2 AR2 
AR4 AR4 
LuF[__ wr [ 
v4] w (7 
ue 2 er) 
NC] NL 
Z[ z[ 
ie v[~ 
cL cL] 
Data memory 
8098A3h 1.7950e+02 8098A$ 733000] 1.79750e+02 
sosssch [__ goss3¢__ oc] 220 


Note: Cycle Count 


See Section 8.5.2, Data Loads and Stores, on page 8-24 for the effects of 
operand ordering on the cycle count. 


13-104 


Integer-to-Floating-Point Conversion FLOAT 


Syntax FLOAT src, dst 
Operation float (src) > dst 
Operands src general addressing modes (G): 


00 register (Rn, 0 < n< 27) 

01 direct 

10 — indirect (disp = 0-255, IRO, IR1) 
11 immediate 


dst register (Rn, 0 < n<7) 


Opcode 
31 24 23 1615 87 0 
Description The integer operand src is converted to the floating-point value equal to it; the 
result is loaded into the dst register. The src operand is assumed to be a signed 
integer; the dst operand is assumed to be a floating-point number. 
Cycles 1 
Status Bits These condition flags are modified only if the destination register is R7-RO. 
LUF Unaffected 
LV Unaffected 
UF 0 
N 1 if a negative result is generated; 0 otherwise 
Z 1 if a O result is generated; 0 otherwise 
Vv 0 
Cc Unaffected 
Mode Bit OVM = Operation is not affected by OVM bit value. 


Assembly Language Instructions 13-105 


FLOAT _Integer-to-Floating-Point Conversion 


Example FLOAT *++AR2(2),R5 


Cc 
Data memory 
809802 


13-106 


Before Instruction 


00 034C 2000]1.27578125e+01 R5 


AR2 
| LUF 
[Lo LV 
[ UF 
[ N 
[Lo Zz 
[Ll V 
Lo] C 
174 809802 


After Instruction 


00 72E0 0000}]1.74e+02 
80 9802 


[ 
Ld 
[ 
[ 
[ 
[ 
[ 


174 


Syntax 


Operation 


Operands 


Opcode 


Description 


Cycles 
Status Bits 


Mode Bit 


Parallel FLOAT and STF FLOAT||STF 


FLOAT src2, dst? 
|| STF src3, dst2 


float(src2 ) > dst1 
|| src3— dst2 


src2__ indirect 
dst? register 
src3_ register 
dst2 register 


— 


disp = 0, 1, IRO, IR1) 
Ri, 0<n1 <7) 
Rn2, 0 < n2<7) 
disp = 0, 1, IRO, IR1) 


~~ 


This instruction’s operands have been augmented in the following devices: 


J ’C31 silicon revision 6.0 or greater 
J ’C32 silicon revision 2.0 or greater 
L) ’VC33 


src2__ indirect (disp = 0, 1, IRO, IR1) or any CPU register 
dst? register (Rn1i,O0<n1 <7) 
src3_ register (Rn2, 0 < n2 <7) 


dsi2—registerdisp—0t 4REIRY 


3 24 23 1615 8 7 0 


1 
0101 ‘| dst [oo 0 src3 dsi2 src2 


An integer-to-floating-point conversion is performed. All registers are read at 
the beginning and loaded at the end of the execute cycle. If one of the parallel 
operations (STF) reads from a register and the operation being performed in 
parallel (FLOAT) writes to the same register, then STF accepts the contents 
of the register as input before it is modified by FLOAT. 


If src2 and dst2 point to the same location, src2 is read before the write to dst2. 
1 
These condition flags are modified only if the destination register is R7—RO0. 


LUF Unaffected 

LV Unaffected 

UF 0 

N 1 if a negative result is generated; 0 otherwise 
Z 1 if a O result is generated; 0 otherwise 
Vv 0 
Cc Unaffected 


OVM Operation is affected by OVM bit value. 


Assembly Language Instructions 13-107 


FLOAT||STF Parallel! FLOAT and STF 


Example FLOAT *+AR2 (IRO) ,R6 
| | STF R7,*ARL 
Before Instruction After Instruction 
R6 R6 [_07 2E00 0000] 1.740e+02 
R7 [03 4c20 0000] 1.27578125e+01 R7 [03 4¢20 0000) 1.27578125e+01 
ARI ARI 
AR2 AR2 
IRO IRO 
wr [_ uF [_ 
w [9] v Cg 
uF [od uF [ 
NL] NL 
zL_ zL__ dd 
v CL v Ld 
cL] cL 
Data memory 
8098CD 174 gogscD [—saoaz} 74 
go9933 [| ——S—Ci 809933 [__034¢2000] 1.27578125e+01 


Note: Cycle Count 


See Section 8.5.2, Data Loads and Stores, on page 8-24 for the effects of 
operand ordering on the cycle count. 


13-108 


Syntax 


Operation 


Operands 


Opcode 


Description 


Cycles 


Status Bits 


Mode Bit 


Interrupt Acknowledge JACK 


IACK src 


Perform a dummy read operation with IACK = 0. 
At end of dummy read, set IACK to 1. 


src general addressing modes (G): 


01 direct 
10 ~ indirect 
31 2423 1615 87 0 
1140419 Glooo00 src 


A dummy read operation is performed if off-chip memory is specified. IACK is 
set to 0, regardless of src location, a half H1 cycle after the beginning of the 
decode phase of the IACK instruction. At the first half of the H1 cycle of the 
completion of the dummy read, IACK is set to 1. The IACK signal will not be 
extended due to multicycle reads with wait states. This instruction can be used 
to generate an external interrupt acknowledge. The IACK signal and the ad- 
dress can be used to signal interrupt acknowledge to external devices. The 
data read by the processor is unused. 


LUF Unaffected 
LV Unaffected 
UF Unaffected 
N Unaffected 
Z Unaffected 
V Unaffected 
Cc Unaffected 


OVM Operation is not affected by OVM bit value. 


Assembly Language Instructions 13-109 


IACK Interrupt Acknowledge 


Example IACK *AR5 

Before Instruction After Instruction 

TACK TACK 
PC PC 301 
LuF[ uF Ld 
vl vf 
uFL___ uF [ 
nC qd er 
zL_ z{ 

v[E 2 | 
cL cL 


13-110 


Syntax 


Operation 


Operands 
Opcode 


Description 


Cycles 


Status Bits 


Mode Bit 


Example 


Idle Until Interrupt IDLE 


IDLE 


1 > ST(GIE) 
Next PC — PC 
Idle until interrupt. 


None 


31 2423 1615 87 0 
jo00 001100;/00000000000000000000000 


The global-interrupt-enable bit is set, the next PC value is loaded into the PC, 
and the CPU idles until an unmasked interrupt is received. When the interrupt 
is received, the contents of the PC are pushed onto the active system stack, 
the interrupt vector is read, and the interrupt service routine is executed. 


LUF Unaffected 
LV Unaffected 
UF Unaffected 
N Unaffected 
Zz Unaffected 
V Unaffected 
Cc Unaffected 


OVM = Operation is not affected by OVM bit value. 


IDLE ; The processor idles until a reset 
; or unmasked interrupt occurs. 


For correct device operation, the three instructions after a delayed 
branch should not be IDLE or IDLE2 instructions. 


Assembly Language Instructions 13-111 


IDLE2 Low-Power Idle 


Syntax 


Operation 


Operands 


Opcode 


Description 


13-112 


IDLE2 (supported by: ’LC31, ’C32, ’C30 silicon revision 7.x or 
greater, C31 silicon revision 5.x or greater) 

1 > ST(GIE) 

Next PC + PC 


Idle until interrupt. 


None 


31 


2423 1615 87 0 


foo 0f001100/00000000000000000000001 


The IDLE2 instruction serves the same function as IDLE, except that it re- 
moves the functional clock input from the internal device. This allows for ex- 
tremely low power mode. The PC is incremented once, and the device remains 
in an idle state until one of the external interrupts (INTO—3) is asserted. 


In IDLE2 mode, the ’C3x devices that support this mode behave as follows: 


L 


L) 


The CPU, peripherals, and memory retain their previous states. 


When the device is in the functional (nonemulation) mode, the clocks stop 
with H1 high and H3 low. 


The device remains in IDLE2 until one of the four external interrupts 
(INT3-INTO) is asserted for at least two H1 cycles. When one of the four 
interrupts is asserted, the clocks start after a delay of one H1 cycle. The 
clocks can start up in the phase opposite that in which they were stopped 
(that is, H1 might start high when H3 was high before stopping, and H3 
might start high when H1 was high before stopping). However, the H1 and 
H3 clocks remain 180° out of phase with each other. 


During IDLE2 operation, one of the four external interrupts must be as- 
serted for at least two H2 cycles to be recognized and serviced by the 
CPU. For the processor to recognize only one interrupt when it restarts op- 
eration, the interrupt must be asserted for less than three cycles. 


When the device is in emulation mode, the H1 and H3 clocks continue to 
run normally, and the CPU operates as if an IDLE instruction had been 
executed. The clocks continue to run for correct operation of the emulator. 


Low-Power Idle IDLE2 


For correct device operation, the three instructions after a delayed 
branch should not be IDLE or IDLE2 instructions. 


Cycles 1 
Status Bits LUF Unaffected 
LV Unaffected 
UF Unaffected 
N Unaffected 
Z Unaffected 
Vv Unaffected 
Cc Unaffected 
Mode Bit OVM Operation is not affected by OVM bit value. 
Example IDLE2 ; The processor idles until a reset 


; or interrupt occurs. 


Assembly Language Instructions 13-113 


LDE Load Floating-Point Exponent 


Syntax LDE src, dst 
Operation src(exp) — dst(exp) 
Operands src general addressing modes (G): 


00 register (Rn, O<n<7) 

01 direct 

10 _ indirect (disp = 0-255, IRO, IR1) 
11 immediate 


dst register (Rn, 0 <n<7) 


Opcode 
31 2423 1615 87 ) 
poe] = | 
Description The exponent field of the src operand is loaded into the exponent field of the 
dst register. No modification of the dst register mantissa field is made unless 
the value of the exponent loaded is the reserved value of the exponent for 0 
as determined by the precision of the src operand. Then the mantissa field of 
the dst register is set to 0. The src and dst operands are assumed to be float- 
ing-point numbers. Immediate values are evaluated in the short floating-point 
format. 
Cycles 1 
Status Bits LUF Unaffected 
LV Unaffected 
UF Unaffected 
N Unaffected 
Z Unaffected 
Vv Unaffected 
Cc Unaffected 
Mode Bit OVM Operation is not affected by OVM bit value. 


13-114 


Example 


LDE RO,R5 


Before Instruction 


RO 02 0005 6F30)4.00066337e+00 
R5 OA 056F E332]1.06749648e+03 


Load Floating-Point Exponent 


After Instruction 


LDE 


RO 02 0005 6F30)/4.00066337e+00 
R5 02 056F E332)/4.16990814e+00 


lurL__ 
wl" 
urL__ 
NC 
zL 
vL__ 
cL 


Assembly Language Instructions 


13-115 


LDF Load Floating-Point Value 


Syntax 
Operation 


Operands 


Opcode 


Description 


Cycles 
Status Bits 


Mode Bit 


Example 


Data memory 


13-116 


809800 


LDF src, dst 
src > dst 
src general addressing modes (G): 


00 register (Rn,O<n<7) 

01 direct 

10 _ indirect (disp = 0-255, IRO, IR1) 
11 immediate 


dst register (Rn, 0 <n<7) 


31 2423 1615 87 0 
eodorrde a : 
The src operand is loaded into the dst register. The dst and src operands are 
assumed to be floating-point numbers. 

1 


These condition flags are modified only if the destination register is R7—RO. 


LUF Unaffected 
LV Unaffected 


UF 0 

N 1 if a negative result is generated; 0 otherwise 
Z 1 if a O result is generated; 0 otherwise 

Vv 0 

Cc Unaffected 


OVM = Operation is not affected by OVM bit value. 
LDF @9800h,R2 


Before Instruction After Instruction 

R2 [01 0¢52 aooo] 2.19254303e+00 
080 DP 080 

[ wr [_ 

[ vl 

[ uF [| 

[Ld NL 

Ld z{[_ 

[ vob 

Ld cL 


010C52A0|2.19254303e+00 809800 010C52A0| 2.19254303e+00 


Syntax 


Operation 


Operands 


Opcode 


Description 


Cycles 


Status Bits 


Mode Bit 


Load Floating-Point Value Conditionally LDFeond 


LDFcond src, dst 


If cond is true: 
src — dst. 


Else: 
dst is unchanged. 


src general addressing modes (G): 


00 register (Rn, O<n<7) 

01 direct 

10 indirect (disp = 0-255, IRO, IR1) 
11 immediate 


dst register (Rn, 0 <n<7) 


31 2423 1615 87 0 


If the condition is true, the src operand is loaded into the dst register; otherwise, 
the dst register is unchanged. The dst and src operands are assumed to be 
floating-point numbers. 


The 'C3x provides 20 condition codes that can be used with this instruction 
(see Table 13-12 on page 13-30 for a list of condition mnemonics, condition 
codes, and flags). Note that an LDFU (load floating-point unconditionally) in- 
struction is useful for loading R7—RO without affecting condition flags. Condi- 
tion flags are set on a previous instruction only when the destination register 
is one of the extended-precision registers (R7—RO) or when one of the com- 
pare instructions (CMPF, CMPF3, CMPI, CMPI3, TSTB, or TSTB3) is ex- 
ecuted. 


LUF Unaffected 
LV Unaffected 
UF Unaffected 
N Unaffected 
Zz Unaffected 
V Unaffected 
Cc Unaffected 


OVM Operation is not affected by OVM bit value. 


Assembly Language Instructions 13-117 


LDFcond Load Floating-Point Value Conditionally 


Example LDFZ 


R3 2C FF2C D500|1.77055560e+13 


R3,R5 


Before Instruction 


R3 


After Instruction 


2C FF2C D500] 1.77055560e+13 


R5 5F 0000 003E}3.96140824e+28 R5 2C FF2C D500} 1.77055560e+13 


LUF 
LV 
UF 


oO =< Nn 2 


13-118 


[ 


Ll 
eT) 
Ll 
[ 
[ol 


LUF 
LV 


Ll 


——aeet 
Ll 
Ll 
[ol 
Ll 


Load Floating-Point Value, Interlocked LDFI 


Syntax LDFI src, dst 
Operation Signal interlocked operation 
src > dst 
Operands src general addressing modes (G): 
01 direct 


10 — indirect (disp = 0-255, IRO, IR1) 
dst register (Rn, 0 <n<7) 


Opcode 
31 1615 87 


2423 0 


Description The src operand is loaded into the dst register. An interlocked operation is sig- 
naled over XFO and XF1. The src and dst operands are assumed to be floating- 
point numbers. Only direct and indirect modes are allowed. See Section 7.4, 
Interlocked Operations, on page 7-13 for a detailed description. 


Cycles 1 if XF1 = 0 (see Section 7.4 on page 7-13) 
Status Bits These condition flags are modified only if the destination register is R7-RO. 


LUF Unaffected 

LV Unaffected 

0 

1 if a negative result is generated; 0 otherwise 
1 if a O result is generated; 0 otherwise 

0 

Unaffected 


Mode Bit OVM Operation is not affected by OVM bit value. 


QO<N2ZC 
Fi 


Assembly Language Instructions 13-119 


LDFI Load Floating-Point Value, Interlocked 


Example LDFI *+AR2,R7 
Before Instruction After Instruction 
R7 R7 -6.28125e+01 
AR2 AR2 
LuF[_ wuF Lo 
wl wf] 
UPL UF Lo] 
N{ | N [Lo] 
z{_ z Li] 
vL_ ol v Lo 
cL] c Lo] 


Data memory 


8098F2h [__584c000] -6.28125¢+01 8098F2h -6.28125e+01 


13-120 


Syntax 


Operation 


Operands 


Opcode 


Description 


Cycles 


Status Bits 


Mode Bit 


Parallel LDF and LDF + LDF||LDF 


LDF src2, dst2 
|| LDF src7, dst? 


src2 > dst2 
|| src1 — dst1 


src? indirect 
dst? register 
src2__ indirect 
dst2 register 


— 


disp = 0, 1, IRO, IR1) 
Rni,0<n1 <7) 
disp = 0, 1, IRO, IR1) 
Rn2, 0 < n2<7) 


~~ O N 


This instruction’s operands have been augmented on the following devices: 


J ’C31 silicon revision 6.0 or greater 
J ’C2 silicon revision 2.0 or greater 
L) *VC33 


src? indirect (disp = 0, 1, IRO, IR1) 
dst? register (Rn1i,0<n1 <7) 


src2__ indirect (disp = 0, 1, IRO, IR1) or any CPU register 
dsi2—register-{Rn2,0-<n2<F} 


31 2423 1615 87 0 
0 


Two floating-point loads are performed in parallel. If the LDFs load the same 
register, the assembler issues a warning. The result is that of LDF src2, dsi2. 


LUF Unaffected 
LV Unaffected 
UF Unaffected 
N Unaffected 
Z Unaffected 
V Unaffected 
Cc Unaffected 


OVM Operation is not affected by OVM bit value. 


Assembly Language Instructions 13-121 


LDF||LDF Parallel LDF and LDF 


Example LDF *--AR1(IRO) ,R7 
| | LDF *AR7++ (1) ,R3 
Before Instruction After Instruction 
R3 RO 
R7 R3 6 .281250e+01 
ARI R7 1.4050e+02 
AR7 ARI 
IRO AR7 
LuF[ lF[ 
vd wd 
uF[_ uF[___ 
NL NL 
zl zL_ 
v[_ vL_ 
cL cL 


Data memory 


809857h 1.4050e+02  809857h 1.4050e+02 
80988Ah 6.281250e+01 80988Ah 6.281250e+01 


Note: Cycle Count 


See Section 8.5.2, Data Loads and Stores, on page 8-24 for the effects of 
operand ordering on the cycle count. 


13-122 


Syntax 


Operation 


Operands 


Opcode 


Description 


Cycles 


Status Bits 


Mode Bit 


Parallel LDF and STF_ LDF||STF 


LDF src2, dst? 
|| STF src3, dst2 


src2 — dst1 
\| src3 > dst2 


src2__ indirect 
dst? register 
src3_ register 
dst2 indirect 


disp = 0, 1, IRO, IR1) 
Ri, 0<n1 <7) 
Rn2, 0 < n2<7) 
disp = 0, 1, IRO, IR1) 


~~ a a ~ 


src2 
dst? 
src3 


This instruction’s operands have been augmented on the following devices: 


J ’C31 silicon revision 6.0 or greater 
J ’C2 silicon revision 2.0 or greater 
LJ ’VC33 


indirect (disp = 0, 1, IRO, IR1) or any CPU register 
register (Rn1, 0 < n1 <7) 
register (Rn2, 0 < n2 <7) 


adsi2 


31 


indirect (disp —=6,+ IREARH 


2423 1615 87 


0 


0110 0} ast: |0 0 o| src3 dsi2 src2 


A floating-point load and a floating-point store are performed in parallel. 


If src2 and dsi2 point to the same location, src2 is read before the write to dst2. 


LUF Unaffected 
LV Unaffected 
Unaffected 
Unaffected 


Unaffected 
Unaffected 


OVM Operation is not affected by OVM bit value. 


UF 

N 

Zz Unaffected 
V 

Cc 


Assembly Language Instructions 


13-123 


LDF||STF Parallel LDF and STF 


Example LDF *AR2--(1),R1 
| | STF R3,*AR4++ (IR1) 

Before Instruction After Instruction 

Rt R1 1.4050e+02 

R3 6.28125e+01 R3 6.28125e+01 
AR2 AR2 
AR4 AR4 
IR It 
WWF [_ luF[ 
wl] Ye] 
uF [| UFL__ 
NL] Xr 
zL_ zL_ 
v{ vL_ 
cL] c{_ 


Data memory 


8098E7h 70C8000]1.4050e+02  8098E7h 1.4050e+02 
809900h 809900h 6.28125e+01 


| 


Note: Cycle Count 


See Section 8.5.2, Data Loads and Stores, on page 8-24 for the effects of 
operand ordering on the cycle count. 


13-124 


Load Integer LDI 


Syntax LDI src, dst 
Operation src > dst 
Operands src general addressing modes (G): 


00 any CPU register 

01 direct 

10 — indirect (disp = 0-255, IRO, IR1) 
11. immediate 


dst any CPU register 


Opcode 
31 24.23 1615 87 0 
eedereoed o] a : 
Description The src operand is loaded into the dst register. The dst and src operands are 
assumed to be signed integers. An alternate form of LDI, LDP, is used to load 
the data-page pointer register (DP). See the LDP instruction in Section 13.6.2 
Optional Assembler Syntax beginning on page 13-34. 
Cycles 1 
Status Bits These condition flags are modified only if the destination register is R7-RO. 
LUF Unaffected 
LV Unaffected 
UF 0 
N 1 if a negative result is generated; 0 otherwise 
Z 1 if a O result is generated; 0 otherwise 
Vv 0 
Cc Unaffected 
Mode Bit OVM Operation is not affected by OVM bit value. 


Assembly Language Instructions 13-125 


LDI = Load Integer 


Example 


13-126 


LDI *-AR1(IRO),R5 


Before Instruction 


R5 00 0000 03C5] 965 
ARI 


Data memory 


27h 38 


After Instruction 


R5 
AR1 


38 


Syntax 


Operation 


Operands 


Opcode 


Description 


Cycles 


Status Bits 


Mode Bit 


Load Integer Conditionally LDIcond 


LDicond src, dst 


If cond is true: 
src — dst, 


Else: 
dst is unchanged. 


src general addressing modes (G): 


00 any CPU register 

01. direct 

10 indirect (disp = 0-255, IRO, IR1) 
11 immediate 


dst any CPU register 


31 2423 1615 87 0 


If the condition is true, the src operand is loaded into the dst register. Other- 
wise, the dst register is unchanged. Regardless of the condition, a read of the 
src takes place. The dst and src operands are assumed to be signed integers. 


The ’C8x provides 20 condition codes that can be used with this instruction 
(see Table 13-12 on page 13-30 for a list of condition mnemonics, condition 
codes, and flags). Note that an LDIU (load integer unconditionally) instruction 
is useful for loading R7—RO without affecting the condition flags. Condition 
flags are set on a previous instruction only when the destination register is one 
of the extended-precision registers (R7—RO) or when one of the compare in- 
structions (CMPF, CMPF3, CMPI, CMPI3, TSTB, or TSTB3) is executed. 


LUF Unaffected 
LV Unaffected 
UF Unaffected 
N Unaffected 
Zz Unaffected 
V Unaffected 
Cc Unaffected 


OVM Operation is not affected by OVM bit value. 


Assembly Language Instructions 13-127 


LDicond Load Integer Conditionally 


Example LDIZ *ARO++,R6 

Before Instruction After Instruction 

R6 [__00 0000 OFEa] 4,066 R6 [_00 0000 oFEQ| 4,066 
ARO ARO 
LuF[__ LuFL__ 
vl wL 
UPL uFL__ 
nC nC 
zd zLD 
v~E vL_ 
cL cL 


Data memory 


8098F0h 636 8098F0h 636 


Note: Auxiliary Register Arithmetic 


The test condition does not affect the auxiliary register arithmetic. (AR 
modification always occurs.) 


13-128 


Syntax 


Operation 


Operands 


Opcode 


Description 


Cycles 


Status Bits 


Mode Bit 


Load Integer, Interlocked 


LDII src, dst 


Signal interlocked operation 
src > dst 


src general addressing modes (G): 


01 direct 
10 _ indirect (disp = 0-255, IRO, IR1) 


dst any CPU register 


31 1615 87 


LDII 


2423 0 


The src operand is loaded into the dst register. An interlocked operation is sig- 
naled over XFO and XF1. The src and dst operands are assumed to be signed 
integers. Note that only the direct and indirect modes are allowed. See Section 


7.4, Interlocked Operations, on page 7-13 for a detailed description. 


1 if XF = 0 (see Section 7.4 on page 7-13) 


These condition flags are modified only if the destination register is R7—RO. 


LUF Unaffected 
LV Unaffected 


UF 0 

N 1 if a negative result is generated; 0 otherwise 
Z 1 if a O result is generated; 0 otherwise 

Vv 0 

Cc Unaffected 


OVM Operation is not affected by OVM bit value. 


Assembly Language Instructions 


13-129 


LDII = Load Integer, Interlocked 


Example LDII @985Fh,R3 


Before Instruction 
R3 
DP 
LuF[ 
vl 
uFL___ 
N[ 
zL_ 
v[E 
cL 
Data memory 


g0985Fh 


13-130 


After Instruction 

R3 
DP 
LuF[ 
wld 
UFL__ 
NC 
zL_ 

vL_ 
cL 


80985Fh 


Syntax 


Operation 


Operands 


Opcode 


Description 


Cycles 


Status Bits 


Mode Bit 


Parallel LDI andLD/ LDI||LDI 


LDI src2, dst2 
|| LDI src7, dst? 


src2 > dst2 
|| src1 — dst1 


src? indirect 
dst? register 
src2__ indirect 
dst2 register 


— 


disp = 0, 1, IRO, IR1) 
Rni,0<n1 <7) 
disp = 0, 1, IRO, IR1) 
Rn2, 0 < n2<7) 


~~ O N 


This instruction’s operands have been augmented on the following devices: 


J ’C31 silicon revision 6.0 or greater 
J ’C2 silicon revision 2.0 or greater 
L) *VC33 


src? indirect (disp = 0, 1, IRO, IR1) 
dst? register (Rn1i,0<n1 <7) 


src2__ indirect (disp = 0, 1, IRO, IR1) or any CPU register 
dsi2—register-{Rn2,0-<n2<F} 


31 2423 1615 87 0 
0 


Two integer loads are performed in parallel. The assembler issues a warning 
if the LDIs load the same register. The result is that of LDI src2, dst2. 


LUF Unaffected 
LV Unaffected 
UF Unaffected 
N Unaffected 
Z Unaffected 
V Unaffected 
Cc Unaffected 


OVM Operation is not affected by OVM bit value. 


Assembly Language Instructions 13-131 


LDI||LDI = Parallel! LDI and LDI 


Example LDI *-AR1(1),R7 
| | LDI *AR7++(IRO),R1 


Before Instruction After Instruction 

Rt Rt 750 

R7 R7 250 
ARI ARt 
AR7 AR7 
IRO IRO 
wwFL_ lwWF[_ 
vod wold 
UF[__ UF[ 
NC N{_ 
zl zl 
v[E v[_ 
cL cL 


Data memory 


809825h 250 809825h 250 
8098C8h 750 8098C8h 750 


Note: Cycle Count 


See Section 8.5.2, Data Loads and Stores, on page 8-24 for the effects of 
operand ordering on the cycle count. 


13-132 


Syntax 


Operation 


Operands 


Opcode 


Description 


Cycles 


Status Bits 


Mode Bit 


Parallel LDI and ST! LDI||STI 


LDI src2, dst1 
|| STI src3, dsi2 


src2 — dst1 
l| src3—> dst2 


src2__ indirect 
dst? register 
src3_ register 
dst2 indirect 


disp = 0, 1, IRO, IR1) 
Ri, 0<n1 <7) 
Rn2, 0 < n2<7) 
disp = 0, 1, IRO, IR1) 


=~ ae 


This instruction’s operands have been augmented on the following devices: 


_j ’C31 silicon revision 6.0 or greater 
J ’C32 silicon revision 2.0 or greater 
LV ’VC33 


src2__ indirect (disp = 0, 1, IRO, IR1) or any CPU register 
dst? register (Rn1,0< ni <7) 


src3_ register (Rn2,0 < n2 <7) 


dsi2—indireet disp 0+ 4REOFRY 


3 2423 1615 87 0 


1 
0110 ‘| ast |000 src3 dsi2 src2 


An integer load and an integer store are performed in parallel. If src2 and dsi2 
point to the same location, src2 is read before the dsi2 is written. 


LUF Unaffected 
LV Unaffected 
UF Unaffected 
N Unaffected 
Zz Unaffected 
V Unaffected 
Cc Unaffected 


OVM Operation is not affected by OVM bit value. 


Assembly Language Instructions 13-133 


LDI||STI = Parallel LDI and STI 


Example LDI *-AR1(1),R2 


| | STI R7,*AR5++(IRO) 


Before Instruction 


R2 
R7 53 
ARI 
ARS 


After Instruction 


R2 220 
R7 53 
AR1 
AR5 


Rofl RoL__ 
wFL_ luFL_ 
vL_ wl 
uF[_ UFL__ 
NCL NL 
zt zl 
vL__ vL__ 
cL cL 


Data memory 
8098E6h 220 
gosszch[ 


8098E6h 220 
goss2ch[___ 35] 53 


Note: Cycle Count 


See Section 8.5.2, Data Loads and Stores, on page 8-24 for the effects of 
operand ordering on the cycle count. 


13-134 


Syntax 
Operation 


Operands 


Opcode 


Description 


Cycles 
Status Bits 


Mode Bit 


Example 


Load Floating-Point Mantissa LDM 


LDM src, dst 
src (man) —> dst (man) 
src general addressing modes (G): 


00 register (Rn, O<n<7) 

01 direct 

10 __ indirect (disp = 0-255, IRO, IR1) 
11 immediate 


dst register (Rn, 0 <n<7) 


31 1615 87 0 


2423 


The mantissa field of the src operand is loaded into the mantissa field of the 
dst register. The dst exponent field is not modified. The src and dst operands 
are assumed to be floating-point numbers. If the src operand is from memory, 
the entire memory contents are loaded as the mantissa. If immediate address- 
ing mode is used, bits 15-12 of the instruction word are forced to 0 by the as- 
sembler. 


1 


LUF Unaffected 
LV Unaffected 
UF Unaffected 
N Unaffected 
Zz Unaffected 
V Unaffected 
Cc Unaffected 


OVM Operation is not affected by OVM bit value. 
LDM 156.75,R2 (156.75 = 071CC00000h) 


Before Instruction After Instruction 

R2 R2 1.22460938e+00 
luFL wuF[ 
vEsid wl 
UPL UF[__ 
NL NL 
zl zL__ 
VEL VL 
cL cL 


Assembly Language Instructions 13-135 


LDP Load Data-Page Pointer 


Syntax 
Operation 


Operands 


Opcode 


Description 


Cycles 
Status Bits 


Mode Bit 


Example 


13-136 


LDP src, DP 
src — data-page pointer 


src is the 8 MSBs of the absolute 24-bit source address (src). 
The “DP” in the operand is optional. 


31 2423 1615 87 0 


This pseudo-op is an alternate form of the LDUI instruction, except that LDP 
is always in the immediate addressing mode. The src operand field contains 
the eight MSBs of the absolute 24-bit src address (essentially, only 
bits 23-16 of src are used). These eight bits are loaded into the eight LSBs 
of the data-page pointer. 


The eight LSBs of the pointer are used in direct addressing as a pointer to the 
page of data being addressed. There is a total of 256 pages, each page 64K 
words long. Bits 31-8 of the pointer are reserved and should be kept set to 0. 


{ 


LUF Unaffected 
LV Unaffected 


UF Unaffected 
N Unaffected 
Z Unaffected 
V Unaffected 
Cc Unaffected 


OVM = Operation is not affected by OVM bit value. 


LDP @809900h, DP 
or 
LDP @809900h 


Before Instruction After Instruction 

DP DP 
luFL_ wuF[ 
2 | wl 
uF[ UF[__ 
NL NE 
zL_ zl 
v>E VE 
cL cL 


Syntax 


Operation 
Operands 
Opcode 


Description 


Cycles 


Status Bits 


Mode Bit 


Example 


Divide Clock by 16 LOPOWER 


LOPOWER (supported by: ’LC31 and ’C32, ’C31 silicon 
revision 5.0 or greater, ’C30 silicon revision 7.0 
or greater) 


H1 > H1/16 


None 


31 23 0 
joo 10 000 1/0000000 000000000 000000 1 


The device continues to execute instructions, but at the reduced rate of the 
CLKIN frequency divided by 16 (that is, in LOPOWER mode, a ’C3x device that 
supports this mode with a CLKIN frequency of 32 MHz performs in the same 
way as a 2-MHz ’C8x device, which has an instruction-cycle time of 1000 ns). 
This allows for low-power operation. 


The ’C3x CPUs slow down during the read phase of the LOPOWER instruc- 
tion. To exit the LOPOWER power-down mode, invoke the MAXSPEED 
instruction (opcode = 1080 0000 h). The ’C3x resumes full-speed operation 
during the read phase of the MAXSPEED instruction. 


Do not run the IDLE2 instruction in the LOPOWER mode. 


LUF Unaffected 
LV Unaffected 
UF Unaffected 
N Unaffected 
Zz Unaffected 
V Unaffected 
Cc Unaffected 


OVM Operation is not affected by OVM bit value. 


LOPOWER ; The processor slows down operation to 
; 1/16th of the H1 clock. 


Assembly Language Instructions 13-137 


LSH Logical Shift 


Syntax 


Operation 


Operands 


Opcode 


Description 


13-138 


LSH count, dst 


lf count = 0: 
dst << count > dst 


Else: 
dst >> |count | > dst 


count general addressing modes (G): 


00 any CPU register 

01 direct 

10 _ indirect (disp = 0-255, IRO, IR1) 
11 immediate 


dst any CPU register 


31 2423 1615 87 0 


The seven LSBs of the count operand are used to generate the 2s-comple- 
ment shift count. If the count operand is greater than 0, the dst operand is left 
shifted by the value of the count operand. Low-order bits shifted in are zero 
filled, and high-order bits are shifted out through the carry (C) bit. 


Logical left shift: 
Ceadstc0 


If the count operand is less than 0, the dst is right shifted by the absolute value 
of the count operand. The high-order bits of the dst operand are zero filled as 
they are shifted to the right. Low-order bits are shifted out through the C bit. 


Logical right shift: 
0-> dst>C 


If the count operand is 0, no shift is performed, and the C bit is set to 0. The 
count operand is assumed to be a signed integer, and the dst operand is as- 
sumed to be an unsigned integer. 


Cycles 


Status Bits 


Mode Bit 


Example 1 


Example 2 


1 


Logical Shift LSH 


These condition flags are modified only if the destination register is R7—R0. 


LUF Unaffected 
LV Unaffected 
F 0 
MSB of the output 


0 


U 
N 
Z 1 if a O output is generated; 0 otherwise 
Vv 
Cc 


Set to the value of the last bit shifted out; 0 for a shift count of 0 


OVM Operation is not affected by OVM bit value. 


LSH R4,R7 


Before Instruction 


R4 24 
R7 
uuF[ 


LSH *-AR5(IR1),R5 


Before Instruction 


R5 00 12C0 0000 
AR5 80 9908 


Data memory 


809904h 19 


After Instruction 


R4 00 0000 0018} 24 
R7 00 ACOO 0000 


After Instruction 


R5 00 0001 2C00 
AR5 80 9908 


809904h =i2 


Assembly Language Instructions 13-139 


LSH3 Logical Shift, 3-Operand 


Syntax 


Operation 


Operands 


Opcode 


Description 


13-140 


LSH3 count, src, dst 


lf count = 0: 
src << count > dst 


Else: 
src >> |count | — dst 


src 3-operand addressing modes (T): 


00 any CPU register 
01 _ indirect (disp = 0, 1, IRO, IR1) 
10 any CPU register 
11. indirect (disp = 0, 1, IRO, IR1) 


count 3-operand addressing modes (T): 


00 any CPU register 
01 any CPU register 
10 _ indirect (disp = 0, 1, IRO, IR1) 
11 indirect (disp = 0, 1, IRO, IR1) 


dst register (Rn, 0 < n< 27) 


31 2423 1615 87 0 


The seven LSBs of the count operand are used to generate the 2s-comple- 
ment shift count. 


If the count operand is greater than 0, a copy of the src operand is left shifted 
by the value of the count operand, and the result is written to the dst. (The src 
is not changed.) Low-order bits shifted in are zero filled, and high-order bits are 
shifted out through the carry (C) bit. 


Logical left shift: 
Ce src<e0 


If the count operand is less than 0, the src operand is right shifted by the abso- 
lute value of the count operand. The high-order bits of the dst operand are 0- 
filled as they are shifted to the right. Low-order bits are shifted out through the 
C bit. 


Logical right shift: 
0-> srce>C 


If the count operand is 0, no shift is performed, and the C bit is set to 0. The 
count operand is assumed to be a signed integer. The src and dst operands 
are assumed to be unsigned integers. 


Logical Shift, 3-Operand LSH3 


Cycles 1 


Status Bits These condition flags are modified only if the destination register is R7—RO. 


LUF Unaffected 
LV Unaffected 


UF 0 
N MSB of the output 
Z 1 if a O output is generated; 0 otherwise 
V 0 
Cc Set to the value of the last bit shifted out; 0 for a shift count of 0; 
unaffected if dstis not R7-RO 
Mode Bit OVM Operation is not affected by OVM bit value. 
Example 1 LSH3 R4,R7,R2 
Before Instruction After Instruction 
R2 R2 
R4 24 R4 24 
R7 R7 
LuF[L___ LuF[__ 
wl wld 
ue uD 
nC N 
z[ zL_ 
v[~E V 
cL] cL 
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LSH3 = Logical Shift, 3-Operand 


Example 2 LSH3 *-AR4(IR1),R5,R3 

Before Instruction After Instruction 

R3 R3 

R5 R5 

AR4 AR4 

i | mit 
wwF[_ wrFL__ 

iv [9 A )| 

UF — == UF[__ 
NC NL 

zL_ zl 
v~_ vL 
cL c[L 

Data memory 
809904h ais 809904h = 


Note: Cycle Count 


See Section 8.5.2, Data Loads and Stores, on page 8-24 for the effects of 
operand ordering on the cycle count. 
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Syntax 


Operation 


Operands 


Opcode 


Description 


Parallel LSH3 and STI LSH3||STI 


LSH3_ count, src2, dst1 
|| STI src3, dst2 


If count = 0: 

src2 << count > dst1 
Else: 

src2 >> |count | > dst1 
|| src3 > dst2 


count register (Rn1, 0< n1 <7) 
src? indirect (disp = 0, 1, IRO, IR1) 
dst? register (Rn8, 0 < n3 <7) 
src2__ register (Rn4, 0 < n4<7) 
dst2 indirect (disp = 0, 1, IRO, IR1) 


~~ ~~ 


This instruction’s operands have been augmented in the following devices: 


_j ’C31 silicon revision 6.0 or greater 
_J ’C32 silicon revision 2.0 or greater 
 ’VC33 


src2__ indirect (disp = 0, 1, IRO, IR1) or any CPU register 
dst? register (Rn1,0<n1 <7) 

src3_ register (Rn2, 0 < n2 <7) 

dsi2—indireet teisp=0,+ 1ROARYH 


3 2423 1615 87 0 


{ 
0111 0} ast | coun src3 dsi2 src2 


The seven LSBs of the count operand are used to generate the 2s-comple- 
ment shift count. 


If the count operand is greater than 0, a copy of the src2 operand is left shifted 
by the value of the count operand, and the result is written to the dst7. (The 
src2 is not changed.) Low-order bits shifted in are zero filled, and high-order 
bits are shifted out through the carry (C) bit. 


Logical left shift: 
Ce src2<¢0 


If the count operand is less than 0, the src2 operand is right shifted by the abso- 
lute value of the count operand. The high-order bits of the dst operand are 0- 
filled as they are shifted to the right. Low-order bits are shifted out through the 
C bit. 
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LSH3||STI = Parallel LSH3 and STI 


Logical right shift: 


0-> src2>C 
If the count operand is 0, no shift is performed, and the carry bit is set to 0. 


The count operand is assumed to be a 7-bit signed integer, and the src2 and 
dst? operands are assumed to be unsigned integers. All registers are read at 
the beginning and loaded at the end of the execute cycle. This means that 
if one of the parallel operations (STI) reads from a register and the operation 
being performed in parallel (LSH3) writes to the same register, STI accepts as 
input the contents of the register before it is modified by the LSH3. 


If src2 and dst2 point to the same location, src2 is read before dst2 is written. 
Cycles 1 
Status Bits These condition flags are modified only if the destination register is R7—RO. 


LUF Unaffected 
LV Unaffected 


UF 0 

N MSB of the output 

Z 1 if a O output is generated; 0 otherwise 

Vv 0 

Cc Set to the value of the last bit shifted out; 0 for a shift count of 0 
Mode Bit OVM Operation is affected by OVM bit value. 
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Parallel LSH3 and ST|_ LSH3||STI 


Example 1 LSH3 R2,*++AR3 (1) ,RO 
|| STI R4,*-AR5 
Before Instruction After Instruction 
RO RO 
R2 24 R2 24 
R4 220 R4 220 
ARS ARS 
ARS ARS 
LuF[__ LuF[_ 
wv oa a 
UPL UF[ 
N[ NC 
zL_ zL_ 
v~> v~E 
cL cL 
Data memory 
8098C3h OAC 8098C3h 
sogsA2zh [=i 8098A2h 220 
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LSH3||STI_ Parallel LSH3 and STI 


Example 2 LSH3 R7, *AR2-- (1) ,R2 
| | STI RO, *+ARO (1) 
Before Instruction After Instruction 
RO 300 RO 300 
R2 R2 
R7 =12 R7 “12 
ARO ARO 
AR2 AR2 
LuF[__ LuF[ 
w J vod 
UF[___ UF[ 
NL] N[ 
zL_ zL_ 
vL vL 
cL cL 
Data memory 
809863h 809863h 
Lo 


8098B8h 


8098B8h 300 


Note: Cycle Count 


See Section 8.5.2, Data Loads and Stores, on page 8-24 for the effects of 
operand ordering on the cycle count. 
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Syntax 


Operation 
Operands 
Opcode 


Description 


Cycles 


Status Bits 


Mode Bit 


Example 


Restore Clock to Regular Speed MAXSPEED 


MAXSPEED (supported by ’C31, C32, ’C31 silicon revision 5.0 
or greater, ’C30 silicon revision 7.0 or greater) 
H1/16 > H1 
None 
31 23 1615 87 0 


jo00 10000 170000000000 0000000 000000 


Exits LOPOWER power-down mode (invoked by LOPOWER instruction with 
opcode 10800001h). The ’LC31 or ’C32 resumes full-speed operation during 
the read phase of the MAXSPEED instruction. 


LUF Unaffected 
LV Unaffected 
UF Unaffected 
N Unaffected 
Zz Unaffected 
V Unaffected 
Cc Unaffected 


OVM Operation is not affected by OVM bit value. 


MAXSPEED ; The processor resumes full-speed operation. 
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MPYF Multiply Floating-Point Value 


Syntax 
Operation 


Operands 


Opcode 


Description 


Cycles 
Status Bits 


Mode Bit 


Example 
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MPYF src, dst 
dst x src > dst 
src general addressing modes (G): 


00 register (Rn, O< n<7) 

01 direct 

10 _ indirect (disp = 0-255, IRO, IR1) 
11 immediate 


dst register (Rn, O<n<7) 


31 2423 1615 87 0 


The product of the dst and src operands is loaded into the dst register. The src 
operand is assumed to be a single-precision floating-point number, and the dst 
operand is an extended-precision floating-point number. 


{ 


These condition flags are modified only if the destination register is R7—RO. 


LUF (1 ifa floating-point underflow occurs; unchanged otherwise 
LV 1 if a floating-point overflow occurs; unchanged otherwise 
UF 1 if a floating-point underflow occurs; 0 otherwise 

N 1 if a negative result is generated; 0 otherwise 

Z 1 if a O result is generated; 0 otherwise 

Vv 1 if a floating-point overflow occurs; 0 otherwise 


Cc Unaffected 
OVM = Operation is not affected by OVM bit value. 
MPYF RO,R2 


Before Instruction After Instruction 


RO 07 0C80 0000} 1.4050e+02 RO | 07 0C80 0000] 1.4050e+02 
R2 03 4C20 0000} 1.27578125e+01 R2|_0A 600F 2000] 1.79247266e+03 


lurL_ LurL__ 
ve sid wl 
UL uF[ dl 
NL NC 
zL z[— 4 
vLE_ vL_ dd 
cL cL 


Syntax 
Operation 


Operands 


Opcode 


Description 


Cycles 


Status Bits 


Mode Bit 


Multiply Floating-Point Value, 3-Operand MPYF3 


MPYFS3 src2, src1, dst 
src? x src2 > dst 
src? 3-operand addressing modes (T): 


00 register 
01 indirect 
10 — register 

indirect 


Rint, 0< nt <7) 
disp = 0, 1, IRO, IR1) 
Rn, 0< ni <7) 

disp = 0, 1, IRO, IR1) 


~~~ 


src2 3-operand addressing modes (T): 


00 register (Rn2, 0 < n2 <7) 

01 register (Rn2, 0 < n2 <7) 

10 _ indirect (disp = 0, 1, IRO, IR1) 
( 


indirect (disp = 0, 1, IRO, IR1) 


dst register (Rn,O<n<7) 


31 2423 1615 87 0 


The product of the src7 and src2 operands is loaded into the dst register. The 
src? and src2 operands are assumed to be single-precision floating-point 
numbers, and the dst operand is an extended-precision floating-point number. 


1 
These condition flags are modified only if the destination register is R7—RO. 


LUF (1 ifa floating-point underflow occurs; unchanged otherwise 


LV 1 if a floating-point overflow occurs; unchanged otherwise 
UF 1 if a floating-point underflow occurs; 0 otherwise 

N 1 if a negative result is generated; 0 otherwise 

Z 1 if a O result is generated; 0 otherwise 

V 1 if a floating-point overflow occurs; 0 otherwise 

Cc Unaffected 


OVM Operation is not affected by OVM bit value. 
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MPYF3 Multiply Floating-Point Value, 3-Operand 


Example 1 MPYF3 RO,R7,R1 
Before Instruction After Instruction 
RO 6.281250e+01 RO 6.281250e+01 
R1 R1 1.12905469e+04 
R7 1.79750e+02 R7 1.79750e+02 
wwF[ LF [d 
v{ i 
UF[ UP [[_ 
NC NC 
zl zL 
v~E vbL_ 
cL cL 
Example 2 MPYF3 *+AR2(IRO),R7,R2 
or 
MPYF3 R7,*+AR2(IRO) ,R2 
Before Instruction After Instruction 
R2 R2 8.82515625e+03 
R7 6.281250e+01 R7 6.281250e+01 
AR2 AR2 
IRO 120 IRO 
LUuF[_ LuFL_ 
i 2 ne) 
UPL UL 
NCL N[ 
eT zl 
v~E_ vE_ 
c[L cL 


Data memory 


80992Ah 70C8000] 1.4050e+02  80992Ah 1.4050e+02 


Note: Cycle Count 


See Section 8.5.2, Data Loads and Stores, on page 8-24 for the effects of 
operand ordering on the cycle count. 
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Parallel MPYF3 and ADDF3 MPYF3||ADDF3 


Syntax MPYF3 srcA, srcB, dst1 
|| ADDF3 srcC, srcD, dst2 


Operation srcA x srcB > dst1 
|| srcC + srcD > dst2 


Operands srcA eae : 
srcB Any two indirect (disp = 0, 1 IRO, IR1) 


srcC Any two register (0 < Rn < 7) 
srcD 


dst? register (a1): 
0 =RO 
1=Rt1 


dst2 register (a2): 


0 =R2 
1=R3 
srct register (Rn, O< n<7) 
src2 register (Rn ie <n<7) 
src3 indirect (disp = 0, 1, IRO, IR1) 
src4 indirect ae 0-255, IRO, IR1) 
P parallel addressing modes (0 < P <3) 
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MPYF3||ADDF3 = Parallel MPYF3 and ADDF3 


This instruction’s operands have been augmented in the following 
devices: 


_j ’C31 silicon version 6.0 or greater 
_j ’C2 silicon version 2.0 or greater 


srcA, srcB, srcC, srcD can be one of the following combinations: 


Register Indirect 
(0<Rn< 7) (disp =0,1,1R0,1R1) | Any CPU Register 
2 2 - 
2 1 1 
2 - 2 


dst? register (d7): 


0 = RO 
1=RI1 
dst2 register (a2): 
0 =R2 
1=R3 
src1 register (Rn, 0 <n<7) 
src2 register (Rn, 0< n<7) 
src3 indirect (disp = 0, 1, IRO, IR1) or any CPU register 


( 
src4 indirect (disp = 0, 1, IRO, IR1) or any CPU register 


P parallel addressing modes (0 < P <3) 


Version 4.7 or earlier of TMS320 floating-point code-generation tools 


P srcA_ srcB srcD_ srcC 
00 src4x src3, src1+ src2 
01 src3 x src1, src4+ src2 
10 src1 x src2, src3+ src4 
11 src3 x src1, src2+ src4 


Version 5.0 or later 


P srcA srcB srcD- srcC 
00 src3 x src4, src1+ src2 
01 sre3 x src1, src4+ src2 
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Opcode 


Description 


Cycles 


Status Bits 


Mode Bit 


Parallel MPYF3 and ADDF3 MPYF3||ADDF3 


10 src1 x src2, src3+ src4 
11 src3 x src1, src2+ src4 


31 24 23 1615 8 7 0 


A floating-point multiplication and a floating-point addition are performed in 
parallel. All registers are read at the beginning and loaded at the end of the 
execute cycle. If one of the parallel operations (MPYF3) reads from a register 
and the operation being performed in parallel (ADDF3) writes to the same reg- 
ister, then MPYF3 accepts the contents of the register as input before it is mo- 
dified by the ADDF3. 


Any combination of addressing modes can be coded for the four possible 
source operands as long as two are coded as indirect and two are coded as 
register. The assignment of the source operands srcA-—srcD to the 
src? — src4 fields varies, depending on the combination of addressing modes 
used, and the P field is encoded accordingly. 


If src2 and dst2 point to the same location, src2 is read before the write to d'st2. 
1 (see Note: Cycle Count) 
These condition flags are modified only if the destination register is R7—RO. 


LUF (1 ifa floating-point underflow occurs; unchanged otherwise 


LV 1 if a floating-point overflow occurs; unchanged otherwise 
UF 1 if a floating-point underflow occurs; 0 otherwise 

N 0 

Z 0 

V 1 if a floating-point overflow occurs; 0 otherwise 

Cc Unaffected 


OVM Operation is not affected by OVM bit value. 
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MPYF3||ADDF3 = Parallel MPYF3 and ADDF3 


Example MPYF3 *AR5++(1),*--AR1(IRO) ,RO 
| | ADDF3 R5,R7,R3 


Note: Cycle Count 
One cycle if: 


J src3 and src4 are in internal memory 
(J src3is in internal memory and src4 is in external memory 


Two cycles if: 


J src3is in external memory and src4 is in internal memory 
J src3 and src4 are in external memory 


For more information see Section 8.5, Clocking Memory Accesses, on page 


8-24. 
Before Instruction After Instruction 
RO RO 2.88867188e+01 

R3 6.281250e+01 R3 3.20250e+02 

R5 1.79750e+02 R5 1.79750e+02 

R7 1.4050e+02 R7 1.4050e+02 
ARI ARI 
AR5 AR5 
Ro Lal iho La] 
LuF[__ LuF[__ 
er wll 
UPL UF[ 
N[ NL 
zL_ | eT) 
v~L_ vE_ 
cL cL 

Data memory 
8098C5h 1.2750e+01 g098C5h 1.2750e+01 


8098A4h 1110000] 2.265625e+00  8098A4h 2.265625e+00 
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Syntax 


Operation 


Operands 


Opcode 


Description 


Cycles 


Parallel MPYF3 and STF_ MPYF3\||STF 


MPYF3  src2, src1, dst 
|| STF src3, dst2 


src? x src2 > dst1 
l| src3— dst2 


src1_ register (Rn1,0<n1 <7) 
Src2__ indirect (disp = 0, 1, IRO, IR1) 
dst? register (Rn3, 0 < n8 <7) 
src3_ register (Rn4, 0 < n4<7) 
dst2 indirect (disp = 0, 1, IRO, IR1) 


This instruction’s operands have been augmented in the following devices: 


J ’C31 silicon revision 6.0 or greater 
J ’C32 silicon revision 2.0 or greater 
L) ’VC33 


src? register (Rn1,0<1n1 <7) 

src2__ indirect (disp = 0, 1, IRO, IR1) or any CPU register 
dst? register (Rn2, 0 < n2 <7) 

src3_ register (Rn3, 0 < n38 <7) 

dst2—indirecitelise —0 1 IROABA 


31 24 23 1615 8 7 0 
Ord tt A src3 dst2 src2 


A floating-point multiplication and a floating-point store are performed in paral- 
lel. All registers are read at the beginning and loaded at the end of the execute 
cycle. If one of the parallel operations (MPYF3) writes to a register and the op- 
eration being performed in parallel (STF) reads from the same register, then 
STF accepts the contents of the register as input before it is modified by the 
MPYF3. 


If src2 and dst2 point to the same location, src2 is read before the write to dsi2. 


1 
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MPYF3||STF Parallel! MPYF3 and STF 


Status Bits 


Mode Bit 


Example 


13-156 


These condition flags are modified only if the destination register is R7—RO. 


LUF (1 ifa floating-point underflow occurs; 0 unchanged otherwise 
LV 1 if a floating-point overflow occurs; unchanged otherwise 
UF 1 if a floating-point underflow occurs; 0 otherwise 

N 1 if a negative result is generated; 0 otherwise 

Z 1 if a O result is generated; 0 otherwise 

Vv 1 if a floating-point overflow occurs; 0 otherwise 


Cc Unaffected 
OVM Operation is not affected by OVM bit value. 


MPYF3 *-AR2(1),R7,RO 
| | STF R3,*ARO-- (IRO) 
Before Instruction After Instruction 

RO RO 8.82515625e+03 
R3 4.7031250e+02 R3 4.7031250e+02 
R7 [05 7840 0000]6.281250e+01 R7 6.281250e+01 
ARO ARO 
AR2 AR2 

IRO IRo 
LF [| wrt 
wf] wl 

uF Lo] UFL___ 

NL N[ 

zL__ zL__ 

——— vL_ 

c[ | cL 


Data memory 


g0982Ah [_70c8000]1.4050e+02 80982Ah 1.4050e+02 


809860h 


809860h 4.7031250e+02 


Note: Cycle Count 


See Section 8.5.2, Data Loads and Stores, on page 8-24 for the effects of 
operand ordering on the cycle count. 


Parallel MPYF3 and SUBF3. MPYF3||SUBF3 


Syntax MPYF3_srcA, srcB, dst? 
|| SUBF3 srcC, srcD, dst2 


Operation srcA x srcB > dst1 
|| srcD—srcC > dst2 


srcA 
Operands srcB | Any two indirect (disp = 0, 1, IRO, IR1) 


srcC Any two register (0 < Rn < 7) 
srcD 


dst? register (d7): 


0 =RO 
1=RI1 
dst2 register (a2): 
0=R2 
1=R3 
srct register (Rn, O<n<7) 
src2 register (Rn, 0< n< 7) 
src3 indirect (disp = 0, 1, IRO, IR1) 
src4 indirect (disp = 0, 1, IRO, IR1) 
P parallel addressing modes (0 < P <3) 
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MPYF3||ADDF3 = Parallel MPYF3 and ADDF3 


This instruction’s operands have been augmented in the following 
devices: 


_j ’C31 silicon version 6.0 or greater 
J ’C82 silicon version 2.0 or greater 


srcA, srcB, srcC, srcD can be one of the following combinations: 


Register Indirect 
(0<Rn< 7) (disp =0,1,1R0,1R1) Any CPU Register 
2 2 - 
2 1 1 
2 - 2 


dst? register (d7): 


0 = RO 
1=R1 
dst2 register (a2): 
0 =R2 
1=R3 
src1 register 7) 


(Rn, O<sns 
src2 register (Rn, 0< n<7) 


src3 indirect (disp = 0, 1, IRO, IR1) or any CPU register 
src4 indirect (disp = 0, 1, IRO, IR1) or any CPU register 


P parallel addressing modes (0 < P <3) 


Version 4.7 or earlier of TMS320 floating-point code-generation tools 


P srcA srcB srcD srcC 

00 src4 x src3, src1 — src2 
01 srce3 x src1, src4 — src2 
10 sre? x src2, src3 — src4 
11 srce3 x src1, src2 — src4 


Version 5.0 or later 


P srcA_ srcB srcD srcC 

00 src3 x src4, src1 — src2 
01 src3 x src1, src4 — src2 
10 src? x src2, src3 — src4 
11 src3 x src1, src2 — src4 
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Opcode 


Description 


Cycles 


Parallel MPYF3 and SUBF3. MPYF3||SUBF3 


31 24 23 1615 8 7 0 
000 Ee srci src2 src3 src4 


A floating-point multiplication and a floating-point subtraction are performed 
in parallel. All registers are read at the beginning and loaded at the end of the 
execute cycle. If one of the parallel operations (MPYF3) reads from a register 
and the operation being performed in parallel (SUBF3) writes to the same reg- 
ister, MPYF3 accepts as input the contents of the register before it is modified 
by the SUBF3. 


Any combination of addressing modes can be coded for the four possible 
source operands as long as two are coded as indirect and two are coded regis- 
ter. The assignment of the source operands srcA — srcD to the src1 — src4 
fields varies, depending on the combination of addressing modes used, and 
the P field is encoded accordingly. 


1 


——___ _.-..._—_—oasaaaSQSoaSQSoQS—QaQa—QaoQaoeoe0@ 


Note: Cycle Count 
One cycle if: 


(J src3and src4 are in internal memory 
(J src3is in internal memory and src4 is in external memory 


Two cycles if: 


(J src3is in external memory and src4 is in internal memory 
J src3 and src4 are in external memory 


For more information see Section 8.5, Clocking Memory Accesses, on page 
8-24. 


| | 
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MPYF3||SUBF3 Parallel MPYF3 and SUBF3 


Status Bits These condition flags are modified only if the destination register is R7—RO. 
LUF (1 ifa floating-point underflow occurs; unchanged otherwise 
LV 1 if a floating-point overflow occurs; unchanged otherwise 
UF 1 if a floating-point underflow occurs; 0 otherwise 
N 0 
Z 0 
Vv 1 if a floating-point overflow occurs; 0 otherwise 
Cc Unaffected 
Mode Bit OVM = Operation is not affected by OVM bit value. 
Example MPYF3 R5,*++AR7(IR1) ,RO 
| | SUBF3 R7, *AR3-- (1) ,R2 
or 
MPYF3 *++AR7(IR1), R5,RO0 
| | SUBF3 R7, *AR3-- (1) ,R2 
Before Instruction After Instruction 
RO RO 2.88867188e+01 
R2 R2 -3.9250e+01 
R5 1.2750e+01 R5 1.2750e+01 
R7 [07 33C0 0000|1.79750e+02 R7 1.79750e+02 
AR3 ARS 
AR7 AR7 
Int IRI 
LUuFL__ LuFL__ 
wl er 
UF[___ UF[_ 
NL NL 
Z| zl 
vL_ v~L__ 
cL cL 


Data memory 


80990Ch 2.250e+00 80990Ch 2.250e+00 
8098B2h 1.4050e+02 8098B2h 1.4050e+02 
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Syntax 
Operation 


Operands 


Opcode 


Description 


Cycles 


Status Bits 


Mode Bit 


Multiply Integer MPYI 


MPYI src, dst 
dst x src > dst 
src general addressing modes (G): 
00 any CPU register 
01 direct 
10 — indirect (disp = 0-255, IRO, IR1) 
11 


immediate 


dst any CPU register 


31 2423 1615 Cis 0 


The product of the dst and src operands is loaded into the dsf register. The src 
and dst operands, when read, are assumed to be 24-bit signed integers. The 
result is assumed to be a 48-bit signed integer. The output to the dst register 
is the 32 LSBs of the result. 


Integer overflow occurs when any of the 16 MSBs of the 48-bit result differs 
from the MSB of the 32-bit output value. 


{ 
These condition flags are modified only if the destination register is R7—RO. 


LUF Unaffected 


LV 1 if an integer overflow occurs; unchanged otherwise 
UF 0 

N 1 if a negative result is generated; 0 otherwise 

Z 1 if a O result is generated; 0 otherwise 

V 1 if an integer overflow occurs; 0 otherwise 

Cc Unaffected 


OVM Operation is affected by OVM bit value. 


Assembly Language Instructions 13-161 


MPYI = Multiply Integer 


Example MPYI R1,R5 


Before Instruction After Instruction 


R1 3,392,081 R1 3,392,081 
R5 7,910,912 R5 -501,377,536 
luF[ i LUF 
vl LV 
uF[ UF 
Lo 
CL 
[J 


[ 
[Ld 
[ 
[ [ 


QO < N Z 
QO < N Z 


13-162 


Syntax 
Operation 


Operands 


Opcode 


Description 


Cycles 


Status Bits 


Mode Bit 


Multiply Integer, 3-Operand MPYI3 


MPYI3 src2, src1, dst 
src? x src2 > dst 
src? 3-operand addressing modes (T): 
00 any CPU register 
01 indirect (disp = 0, 1, IRO, IR1) 
any CPU register 
indirect (disp = 0, 1, IRO, IR1) 
src2 3-operand addressing modes (T): 
00 any CPU register 
01 any CPU register 
1 indirect (disp = 0, 1, IRO, IR1) 
11 indirect (disp = 0, 1, IRO, IR1) 


dst register (Rn, 0 < n< 27) 


31 24 23 16 15 8 7 0 


The product of the src? and src2 operands is loaded into the dst register. The 
src? and src2 operands are assumed to be 24-bit signed integers. The result 
is assumed to be a signed 48-bit integer. The output to the dst register is the 
32 LSBs of the result. 


Integer overflow occurs when any of the 16 MSBs of the 48-bit result differs 
from the MSB of the 32-bit output value. 


{ 
These condition flags are modified only if the destination register is R7—RO. 


LUF Unaffected 


LV 1 if an integer overflow occurs; unchanged otherwise 
UF 0 

N 1 if a negative result is generated; 0 otherwise 

Z 1 if a O result is generated; 0 otherwise 

V 1 if an integer overflow occurs; 0 otherwise 

Cc Unaffected 


OVM Operation is affected by OVM bit value. 


Assembly Language Instructions 13-163 


MPYI3 = Multiply Integer, 3-Operand 


Example 1 MPYI3 *AR4,*-AR1(1) ,R2 
Before Instruction After Instruction 
R2 R2 38, 060 
ARI ARI 
AR4 AR4 
LuFL_ LuF [__ 
v[ vw Lo 
uUFL_ UP [ 
nC] nC] 
zL_ z[ 
vL_ v Lf 
cL c Lo 
Data memory 
809850h 173 809850h 173 
8098F2h ODC} 220 8098F2h opc 220 
Example 2 MPYI3 *--AR4(IRO) ,R2,R7 
Before Instruction After Instruction 
R2 200 R2 200 
R7 R7 10,000 
AR4 AR4 
IRO IRO 
wwF[_ wr [ 
wd w [4] 
UPL UF [Od 
NL N Lol 
zL_ z[ 
vL_ v Lo 
cL c Lo 
Data memory 
8099F0h 50 8099F0h 50 


Note: Cycle Count 


See Section 8.5.2, Data Loads and Stores, on page 8-24 for the effects of 
operand ordering on the cycle count. 


13-164 


Syntax 


Operation 


Operands 


Parallel MPY1I3 and ADDI3 


MPYI3 _— srcA, srcB, dst1 
|| ADDI3 — srcC, srcD, dst2 


srcA x srcB > dst? 
|| srcD+ srcC > dsi2 


srcA 


srcB 
srcC 
srcD 


srcA 


dst? 


dst2 


Any two indirect (disp = 0, 1, IRO, IR1) 
Any two register (0 < Rn < 7) 


register (d7): 


0 =RO 

1=Rt1 

register (a2): 

0=R2 

1=R3 

register (Rn, O<n<7) 
register (Rn, 0< n<7) 
indirect (disp = 0, 1, IRO, IR1) 
indirect (disp = 0, 1, IRO, IR1) 


parallel addressing modes (0 < P < 3) 


MPYI3||ADDI3 


, srcB, srcC, srcD can be one of the following combinations: 


Assembly Language Instructions 13-165 


MPYI3||ADDI3 = Parallel MPYI3 and ADDI3 


13-166 


This instruction’s operands have been augmented in the following 
devices: 


_j ’C31 silicon version 6.0 or greater 
J ’C82 silicon version 2.0 or greater 


srcA, srcB, srcC, srcD can be one of the following combinations: 


Register Indirect 
(0<Rn< 7) (disp = 0,1,1R0,IR1) Any CPU Register 
2 2 - 
2 1 1 
2 - 2 


dst? register (d7): 


0 = RO 

1=Rt1 
dst2 register (d2): 

0=R2 

1=R3 
src register (Rn, 0 <n<7) 
src2 register (Rn, 0< n<7) 


src3 indirect (disp = 0, 1, IRO, IR1) or any CPU register 
src4 indirect (disp = 0, 1, IRO, IR1) or any CPU register 


P parallel addressing modes (0 < P <3) 


Version 4.7 or earlier of TMS320 floating-point code-generation tools 


P srcA srcB srcD_ srcC 
00 src4 x src3, src1 + src2 
01 src3 x src1, src4 + src2 
10 src? x src2, src4 + src4 
11 src3 x src1, src2 + src4 


Version 5.0 or later 


P srcA  srcB srcD  srcC 
00 src3 x src4, src1 + src2 
01 src3 x src1, src4 + src2 
10 src? x src2, src3 + src4 
11 src3 x src1, src2 + src4 


Opcode 


Description 


Cycles 


Status Bits 


Mode Bit 


Example 


Parallel MPYI3 and ADDI3) ~MPY13||ADDI3 


31 2423 1615 87 0 
0010 lala src src2 src3 src4 


An integer multiplication and an integer addition are performed in parallel. All 
registers are read at the beginning and loaded at the end of the execute cycle. 
This means that if one of the parallel operations (MPYI3) reads from a register 
and the operation being performed in parallel (ADDI3) writes to the same reg- 
ister, MPYI3 accepts the contents of the register as input before it is modified 
by the ADDIS. 


Any combination of addressing modes can be coded for the four possible 
source operands as long as two are coded as indirect and two are coded as 
register. The assignment of the source operands srcA-srcD to the 
src1 — src4 fields varies, depending on the combination of addressing modes 
used, and the P field is encoded accordingly. To simplify processing when the 
order is not significant, the assembler may change the order of operands in 
commutative operations. 


1 (see Note: Cycle Count on page 13-167) 
These condition flags are modified only if the destination register is R7—RO. 


LUF Unaffected 

LV 1 if an integer overflow occurs; unchanged otherwise 
UF 0 

N 0 

Z 0 

V 1 if an integer overflow occurs; 0 otherwise 
Cc Unaffected 


OVM Operation is affected by OVM bit value. 


MPYI3 R7,R4,R0 
| | ADDI3 *-AR3,*AR5--(1) ,R3 


Assembly Language Instructions 13-167 


MPYI3||ADDI3 = Parallel! MPYI3 and ADD13 


Cc 

Data memory 
80981Eh 
80996Eh 


Note: 


Before Instruction 
00 0000 0000 
00 0000 0000 
00 0000 0064 
00 0000 0014 

80 981F 


80 996E 


OFFFFFFCB 


PULLER 
POORER 


35 


Cycle Count 


One cycle if: 


Two cycles if: 


100 
20 


-53 
a3 


80981Eh 
80996Eh 


_jJ src3 and src4 are in internal memory 
_j src3is in internal memory and src4 is in external memory 


After Instruction 
00 0000 0O7DO 
00 0000 0000 
00 0000 0064 
00 0000 0014 

80 981F 


80 996D 


OFFFFFFCB 


35 


2000 


100 
20 


-53 


53 


_j src3is in external memory and src4 is in internal memory 
_j src3 and src4 are in external memory 


For more information see Section 8.5, Clocking Memory Accesses, on page 


8-24. 


13-168 


Syntax 


Operation 


Operands 


Opcode 


Description 


Cycles 


Parallel MPYI3 and ST! MPYI3||STI 


MPYI3 — src2, src1, dst1 
|| STI src3, dst2 


src? x src2 > dst1 
l| src3— dst2 


src1_ register (Rn1,0<n1 <7) 
Src2__ indirect (disp = 0, 1, IRO, IR1) 
dst? register (Rn3, 0 < n8 <7) 
src3_ register (Rn4, 0 < n4<7) 
dst2 indirect (disp = 0, 1, IRO, IR1) 


This instruction’s operands have been augmented in the following devices: 


J ’C31 silicon revision 6.0 or greater 
J ’C32 silicon revision 2.0 or greater 
L) ’VC33 


src? register (Rn1,0<n1 <7) 

src2__ indirect (disp = 0, 1, IRO, IR1) or any CPU register 
dst? register (Rn2, 0 < n2 <7) 

src3_ register (Rn3, 0 < n3 <7) 

dst2— indirect aisp—0, t+ 4ROARH 


31 2423 1615 8 7 0 
1000 of ast | sci | src3 dsi2 src2 


An integer multiplication and an integer store are performed in parallel. All reg- 
isters are read at the beginning and loaded at the end of the execute cycle. If 
one of the parallel operations (STI) reads from a register and the operation be- 
ing performed in parallel (MPYI3) writes to the same register, STI accepts as 
input the contents of the register before it is modified by the MPYI3. 


If src2 and dst2 point to the same location, src2 is read before the write to dst2. 


Integer overflow occurs when any of the 16 MSBs of the 48-bit result differ from 
the MSB of the 32-bit output value. 


1 


Assembly Language Instructions 13-169 


MPYI3||STI = Parallel MPYI3 and STI 


Status Bits These condition flags are modified only if the destination register is R7—RO. 
LUF Unaffected 
LV 1 if an integer overflow occurs; unchanged otherwise 
UF 0 
N 1 if a negative result is generated; 0 otherwise 
Z 1 if a O result is generated; 0 otherwise 
Vv 1 if an integer overflow occurs; 0 otherwise 
Cc Unaffected 
Mode Bit OVM Operation is affected by OVM bit value. 
Example MPYI3 *++AR0(1),R5,R7 
| | STI R2,*-AR3 (1) 
Before Instruction After Instruction 
R2 220 R2 220 
R5 50 R5 50 
R7 R7 10000 
ARO ARO 
AR3 ARS 
LuF[_ WF [| 
wv 7] vw fd] 
UP[___ 2 | 
NL N Lo 
Z[= z[L 
vL_ v Lo 
cL c Lo 
Data memory 
80995Bh 200 80995Bh ocs] 200 
go9s2Eh [| —~—~—SCi 80982Eh ope] 220 


Note: Cycle Count 


See Section 8.5.2, Data Loads and Stores, on page 8-24 for the effects of 
operand ordering on the cycle count. 


13-170 


Syntax 


Operation 


Operands 


Parallel MPYI3 and SUBI3 


MPYI3 srcA, srcB, dst1 
|| SUBI3 srcC, srcD, dst2 


srcA x srcB > dst? 
|| srcD—srcC > dst2 


srcA 


srcB 
srcC 
srcD 


srcA 


dst? 


dst2 


Any two indirect (disp = 0, 1, IRO, IR1) 
Any two register (0 < Rn < 7) 


register (d7): 


0 =RO 

1=Rt1 

register (a2): 

0=R2 

1=R3 

register (Rn, O<n<7) 
register (Rn, 0< n<7) 
indirect (disp = 0, 1, IRO, IR1) 
indirect (disp = 0, 1, IRO, IR1) 


parallel addressing modes (0 < P < 3) 


MPYI3||SUBI3 


, srcB, srcC, srcD can be one of the following combinations: 


Assembly Language Instructions 13-171 


MPYI3||SUBI3 = Paralle! MPYI3 and SUBI3 


13-172 


This instruction’s operands have been augmented in the following 
devices: 


J ’C31 silicon version 6.0 or greater 
_j ’C82 silicon version 2.0 or greater 


srcA, srcB, srcC, srcD can be one of the following combinations: 


Register Indirect 
(0<Rn< 7) (disp = 0, 1, IRO, IR1) Any CPU Regisiter 
2 2 - 
2 1 1 
2 - 2 


dst? register (d7): 


0 =RO 
1=Rt1 
dst2 register (a2): 
0 =R2 
1=R3 
src1 register (Rn, 0 <n<7) 
src2 register (Rn, 0< n<7) 
src3 indirect (disp = 0, 1, IRO, IR1) or any CPU register 


( 
src4 indirect (disp = 0, 1, IRO, IR1) or any CPU register 


P parallel addressing modes (0 < P <3) 


Version 4.7 or earlier of TMS320 floating-point code-generation tools 


P srcA_ srcB srcD_ srcC 
00 src4 x src3, src1 + src2 
01 src3 x src1, src4 + src2 
10 src1 x src2, src4 + src4 
11 src3 x src1, src2 + src4 


Opcode 


Description 


Cycles 
Status Bits 


Mode Bit 


Example 


Parallel MPYI3 and SUBI3) ~MPYI3||SUBI3 


Version 5.0 or later 


P srcA  srcB srcD — srcC 
00 src3 x src4, src1 + src2 
01 src3 x srct, src4 + src2 
10 src? x src2, src3 + src4 
11 src3 x srct, src2 + src4 
31 24 23 1615 8 7 


An integer multiplication and an integer subtraction are performed in parallel. 
All registers are read at the beginning and loaded at the end of the execute 
cycle. If one of the parallel operations (MPYI3) reads from a register and the 
operation being performed in parallel (SUBI3) writes to the same register, 
MPYI3 accepts the contents of the register as input before it is modified by the 
SUBI3. 


Any combination of addressing modes can be coded for the four possible 
source operands as long as two are coded as indirect and two are coded as reg- 
ister. The assignment of the source operands srcA — srcD to the src1 — src4 
fields varies, depending on the combination of addressing modes used, and the 
P field is encoded accordingly. To simplify processing when the order is not sig- 
nificant, the assembler may change the order of operands in commutative op- 
erations. 


Integer overflow occurs when any of the 16 MSBs of the 48-bit result differs 
from the MSB of the 32-bit output value. 


1 (see Note: Cycle Count on page 13-173) 
These condition flags are modified only if the destination register is R7—RO. 


LUF Unaffected 


LV 1 if an integer overflow occurs; unchanged otherwise 
UF 1 if an integer underflow occurs; 0 otherwise 

N 0 

Z 0 

V 1 if an integer overflow occurs; 0 otherwise 

Cc Unaffected 


OVM Operation is affected by OVM bit value. 


MPYI3 R2,*++AR0(1),RO 
| | SUBI3 *AR5--(IR1) ,R4,R2 


Assembly Language Instructions 13-173 


MPYI3||SUBI3 = Paralle! MPYI3 and SUBI3 


or 
MPYI3 *+4+AR0(1),R2,RO0 
| | SUBI3 *AR5--(IR1) ,R4,R2 
Before Instruction After Instruction 
RO RO 4900 
R2 50 R2 800 
Ra 2000 R4 2000 
ARO ARO 
AR5 AR5 
IR1 IR1 
LuF[__ LUF[ 
wf] wl] 
UF[ uUF[__ 
NL] NL] 
z[ z{_ 
vL vL_ 
cL] cL] 
Data memory 
8098E4h 98 8098E4h 98 
8099FCh 1200 8099FCh 4B0| 1200 
Note: Cycle Count 
One cycle if: 


Two cycles if: 


J src3 and src4 are in internal memory 
J src3is in internal memory and src4 is in external memory 


(J src3is in external memory and src4 is in internal memory 
J src3 and src4 are in external memory 


For more information see Section 8.5, Clocking Memory Accesses, on page 


8-24. 


13-174 


Negative Integer With Borrow NEGB 


Syntax NEGB src, dst 
Operation 0-src—-C—- ast 
Operands src general addressing modes (G): 


00 any CPU register 

01 direct 

10 _ indirect (disp = 0-255, IRO, IR1) 
11 immediate 


dst any CPU register 


Opcode 
31 24 23 16 15 87 0 
pveerer de] = : 
Description The difference of the 0, src, and C operands is loaded into the dst register. The 
dst and src are assumed to be signed integers. 
Cycles 1 
Status Bits These condition flags are modified only if the destination register is R7—RO. 
LUF Unaffected 
LV 1 if an integer overflow occurs; unchanged otherwise 
UF 0 
N 1 if a negative result is generated; 0 otherwise 
Z 1 if a O result is generated; 0 otherwise 
V 1 if an integer overflow occurs; 0 otherwise 
Cc 1 if a borrow occurs; 0 otherwise 
Mode Bit OVM Operation is affected by OVM bit value. 
Example NEGB R5,R7 
Before Instruction After Instruction 
RS -53 R5 -53 
R7 R7 52 
wuF L___ LuFL__ 
wf w[ 
UF [| UF[ 
NL] NC] 
_ ———er zt 
v~L vL__ 
Cc Cc 


Assembly Language Instructions 13-175 


NEGF _Negate Floating-Point Value 


Syntax 
Operation 


Operands 


Opcode 


Description 


Cycles 


Status Bits 


Mode Bit 


13-176 


NEGF src, dst 
0 - src > dst 
src general addressing modes (G): 


00 register (Rn, O<n<7) 

01 ~~ direct 

10 _ indirect (disp = 0-255, IRO, IR1) 
11 immediate 


dst register (Rn, 0 < n< 7) 


31 24 23 1615 8 7 0 


The difference of the 0 and src operands is loaded into the dst register. The 
dst and src operands are assumed to be floating-point numbers. 


1 
These condition flags are modified only if the destination register is R7—RO. 


LUF (1 ifa floating-point underflow occurs; unchanged otherwise 
1 if a floating-point overflow occurs; unchanged otherwise 
1 if a floating-point underflow occurs; 0 otherwise 

1 if a negative result is generated; 0 otherwise 

1 if a O result is generated; 0 otherwise 

1 if a floating-point overflow occurs; 0 otherwise 

U 


naffected 


OVM Operation is affected by OVM bit value. 


Example 


Cc 
Data memory 
809802h 


NEGF *++AR3(2),R1 


Before Instruction 


Negate Floating-Point Value NEGF 


After Instruction 


05 7B40 0025} 6.28125006e+01 R1 07 F380 0000| -1.4050e+02 


80 9800 


[Lo 
Lo 
— || 
Lo 
CL 
Lo 
Lo 


70C8000}| 1.4050e+02 


AR3 
LUF 
LV 
UF 


QO< N Z 


[ 
[ 
Lo 
[ 
Lt 
[ 
Lo] 


809802h 1.4050e+02 


Assembly Language Instructions 


13-177 


NEGF||STF 


Syntax 


Operation 


Operands 


Opcode 


Description 


Cycles 
Status Bits 


Mode Bit 


13-178 


Parallel NEGF and STF 


NEGF src2, dst? 
|| STF — src3, dst2 


0 - src2 = dst1 
|| src3— dsi2 


disp = 0, 1, IRO, IR1) 
Rn, 0< ni <7) 
Rn2, 0 < n2 <7) 
disp = 0, 1, IRO, IR1) 


src2__ indirect 
dst? register 
src3 register 
dst2 indirect 


~~ a a> 


This instruction’s operands have been augmented in the following devices: 


(J ’C31 silicon revision 6.0 or greater 
(J ’C32 silicon revision 2.0 or greater 
Lj ’VC33 


src2__ indirect (disp = 0, 1, IRO, IR1) or any CPU register 
dst? register (Rn1,0<1n1 <7) 


src3_ register (Rn2, 0 < n2 <7) 
dst2— indirect (disp = 0, + tROtRT) 


3 


‘ 24 23 1615 87 0 


A floating-point negation and a floating-point store are performed in parallel. 
All registers are read at the beginning and loaded at the end of the execute 
cycle. This means that if one of the parallel operations (STF) reads from a reg- 
ister and the operation being performed in parallel (NEGF) writes to the same 
register, STF accepts the contents of the register as input before it is modified 
by the NEGF. 


If src2 and dst2 point to the same location, src2 is read before the write to dst2. 
1 


These condition flags are modified only if the destination register is R7—RO. 


LUF (1 ifa floating-point underflow occurs; unchanged otherwise 
LV 1 if a floating-point overflow occurs; unchanged otherwise 
UF 1 if a floating-point underflow occurs; 0 otherwise 

N 1 if a negative result is generated; 0 otherwise 

Z 1 if a O result is generated; 0 otherwise 

Vv 1 if a floating-point overflow occurs; 0 otherwise 


Cc Unaffected 
OVM = Operation is not affected by OVM bit value. 


Parallel NEGF and STF_ NEGF||STF 


Example NEGF *AR4—— (1) ,R7 
| | STF R2,*++AR5 (1) 
Before Instruction After Instruction 
R2 1.79750e+02 R2 1.79750e+02 
R7 R7 [05 84co 0000] -6.281250e+01 
AR4 AR4 
ARS ARS 
wuF [ WWF [ 
v C9] w [9] 
ue [a ue [1] 
NL] N [To] 
zL___ zL_ ol 
v Lo] v Lol 
| cf 4] 
Data memory 
8098E1h 6.281250e+01 8098E1h 6.281250e+01 
gogso4h [CO 809804h 1.79750e+02 


Note: Cycle Count 


See subsection 8.5.2, Data Loads and Stores, on page 8-24 for the effects 
of operand ordering on the cycle count. 


Assembly Language Instructions 13-179 


NEGI  Negate Integer 


Syntax NEGI src, dst 
Operation 0 - src > ast 
Operands src general addressing modes (G): 


00 any CPU register 

01 direct 

10 _ indirect (disp = 0-255, IRO, IR1) 
11 immediate 


dst any CPU register 


Opcode 
31 24 23 16 15 87 0 
porno] ~ |= 
Description The difference of the 0 and src operands is loaded into the dst register. The 
dst and src operands are assumed to be signed integers. 
Cycles 1 
Status Bits These condition flags are modified only if the destination register is R7-RO. 
LUF Unaffected 
LV 1 if an integer overflow occurs; unchanged otherwise 
UF 0 
N 1 if a negative result is generated; 0 otherwise 
Z 1 if a O result is generated; 0 otherwise 
Vv 1 if an integer overflow occurs; 0 otherwise 
Cc 1 if a borrow occurs; 0 otherwise 
Mode Bit OVM Operation is affected by OVM bit value. 
Example NEGI 174,R5 (174 = OAEh) 
Before Instruction After Instruction 
RS 220 RS -174 
LuF [ wuF [ 
wv Lo] wv Lo] 
UF [oo UF [| 
N [ol N 
z Lo] z Li] 
v [| v Coo] 
c C9] C 


13-180 


Syntax 


Operation 


Operands 


Opcode 


Description 


Cycles 
Status Bits 


Mode Bit 


Parallel NEGI and ST! NEGI||STI 


NEGI = src2, dst? 
|| STI src3, dst2 


0 - src2 > dst1 
| src3— dst2 


disp = 0, 1, IRO, IR1) 
Ri, 0<n1 <7) 
Rn2, 0 < n2<7) 
disp = 0, 1, IRO, IR1) 


src2__ indirect 
dst? register 
src3_ register 
dst2 indirect 


~~~ 


This instruction’s operands have been augmented in the following devices: 


_j ’C31 silicon revision 6.0 or greater 
J ’C32 silicon revision 2.0 or greater 
L) ’VC33 


src2__ indirect (disp = 0, 1, IRO, IR1) or any CPU register 
dst? register (Rni, 0 <n1 <7) 
src3_ register (Rn2, 0 < n2 <7) 


isp=0, t+ tRORt) 


31 24 23 1615 87 0 


10010 asst [000 src3 dsi2 src2 


An integer negation and an integer store are performed in parallel. All registers 
are read at the beginning and loaded at the end of the execute cycle. If one 
of the parallel operations (STI) reads from a register and the operation being 
performed in parallel (NEGI) writes to the same register, STI accepts the con- 
tents of the register as input before it is modified by the NEGI. 


If src2 and dst2 point to the same location, src2 is read before the write to dst2. 
1 
These condition flags are modified only if the destination register is R7—RO. 


LUF Unaffected 


LV 1 if an integer overflow occurs; unchanged otherwise 
UF 0 

N 1 if a negative result is generated; 0 otherwise 

Z 1 if a O result is generated; 0 otherwise 

V 1 if an integer overflow occurs; 0 otherwise 

Cc 1 if a borrow occurs; 0 otherwise 


OVM Operation is affected by OVM bit value. 


Assembly Language Instructions 13-181 


NEGI||STI  Paralle! NEG/ and STI 


Example NEGI *-AR3,R2 
| | STI R2,*AR1++ 
Before Instruction After Instruction 
R2 25 R2 -220 
ARI ARI 
AR3 AR3 
| LurF [0] 
vw Lo iv [ol 
UF [ UF [_ 
N [9] N 
zC 9 zo 
v Lo] v Lo] 
| eT c 
Data memory 
80982Eh opc] 220 80982Eh 220 
sossAsh [i] 8098ASh 25 


Note: Cycle Count 


See subsection 8.5.2, Data Loads and Stores, on page 8-24 for the effects 
of operand ordering on the cycle count. 


13-182 


Syntax 


Operation 


Operands 


Opcode 


Description 


Cycles 


Status Bits 


Mode Bit 


Example 1 


Example 2 


No Operation 


NOP src 


No ALU or multiplier operations. 
ARn is modified if src is specified in indirect mode. 


src general addressing modes (G): 
00 _~ register(no operation) 


10 indirect (modify ARn, 0 < n<7) 
(disp = 0-255, IRO, IR1) 


31 15 8 7 


24 23 16 
fooo|o11001/@ |oo000 src 


NOP 


0 


If the src operand is specified in the indirect mode, the specified addressing 
operation is performed, and a dummy memory read occurs. If the src operand 


is omitted, no operation is performed. 


LUF Unaffected 
LV Unaffected 
UF Unaffected 
N Unaffected 
Zz Unaffected 
V Unaffected 
Cc Unaffected 


OVM Operation is not affected by OVM bit value. 


NOP 
Before Instruction After Instruction 
PC PC 
NOP *AR3--(1) 
Before Instruction After Instruction 
AR3 AR3|___80_98FF| 
PC pceL__ 


Assembly Language Instructions 


13-183 


NORM _ Normalize 


Syntax 
Operation 


Operands 


Opcode 


Description 


Cycles 


Status Bits 


Mode Bit 


13-184 


NORM src, dst 
norm (src) — dst 
src general addressing modes (G): 


00 register (Rn, O<n<7) 

01 direct 

10 _ indirect (disp = 0-255, IRO, IR1) 
11 immediate 


31 24 23 1615 87 0 


The src operand is assumed to be an unnormalized floating-point number; that 
is, the implied bit is set equal to the sign bit. The dst is set equal to the normal- 
ized src operand with the implied bit removed. The dst operand exponent is 
set to the src operand exponent minus the size of the left shift necessary to 
normalize the src. The dst operand is assumed to be a normalized floating- 
point number. 


If src (exp) = -128 and src (man) = 0, then dst = 0, Z= 1, and UF =0. 
If src (exp) = -128 and src (man) # 0, then dst= 0, Z = 0, and UF = 1. 


For all other cases of the src, if a floating-point underflow occurs, then 
dst (man) is forced to 0 and dst (exp) = -128. 


If src (man) = 0, then dst (man) = 0 and dst (exp) = -128. 


Refer to Section 5.7, Normalization Using the NORM Instruction, on page 5-37 
for more information. 


1 
These condition flags are modified only if the destination register is R7—RO. 


LUF (1 ifa floating-point underflow occurs; unchanged otherwise 
LV Unaffected 


UF 1 if a floating-point underflow occurs; 0 otherwise 
N 1 if a negative result is generated; 0 otherwise 

Z 1 if a O result is generated; 0 otherwise 

Vv 0 

Cc Unaffected 


OVM = Operation is not affected by OVM bit value. 


Normalize NORM 


Example NORM R1,R2 

Before Instruction After Instruction 
Rt Rt 
R2 R2 [_F2 6BD4 0000] 1.12451613e - 04 
wr [ uF [ 
vw Lo vw Lo 
uF [ uF [_ 
vn Cd nC 4 
z Cd zC gd 
ec er 
cL cL 


Assembly Language Instructions 13-185 


NOT _Bitwise-Logical Complement 


Syntax 
Operation 


Operands 


Opcode 


Description 


Cycles 


Status Bits 


Mode Bit 


13-186 


NOT src, dst 
~src > dst 
src general addressing modes (G): 


00 any CPU register 

01 direct 

10 _ indirect (disp = 0-255, IRO, IR1) 
11 immediate 


dst any CPU register 


31 2423 1615 8 7 0 


The bitwise-logical complement of the src operand is loaded into the dst regis- 
ter. The complement is formed by a logical NOT of each bit of the src operand. 
The dst and src operands are assumed to be unsigned integers. 


{ 
These condition flags are modified only if the destination register is R7—RO. 


LUF Unaffected 

LV Unaffected 

UF 0 

N MSB of the output 

Z 1 if a O result is generated; 0 otherwise 
Vv 0 

Cc Unaffected 


OVM Operation is affected by OVM bit value. 


Bitwise-Logical Complement NOT 


Example NOT @982Ch,R4 

Before Instruction After Instruction 

R4 R4 

DP 080 DP 080 

wr Ld wWF Ld 

we [al vw Lo 

uF [ol UF [ 

n [9] N 

z C4] zC 9 

v Lo v Lol 

c Lo i nn) 

Data memory 

80982Ch 80982Ch 


Assembly Language Instructions 13-187 


NOT||STI Parallel NOT and STI 


Syntax NOT _ src2, dst? 
|| STI src3, dst2 
Operation ~src2 > dst? 
|| src3 > dst2 
Operands src2__ indirect (disp = 0, 1, IRO, IR1) 


dst? register 
src3_ register 
dst2 indirect 


Rn, 0<n1 <7) 
Rn2, 0 < n2 <7) 
disp = 0, 1, IRO, IR1) 


—~ ~~~ 


This instruction’s operands have been augmented in the following devices: 


_j ’C31 silicon revision 6.0 or greater 
(J ’C32 silicon revision 2.0 or greater 
Lj ’VC33 


src2__ indirect (disp = 0, 1, IRO, IR1) or any CPU register 
dst? register (Rn1,0<1n1 <7) 

src3_ register (Rn2, 0 < n2 <7) 

dst2— indirect (disp = 6, +, tROART 


Opcode 
31 24 23 16 15 8 7 0 
Description A bitwise-logical NOT and an integer store are performed in parallel. All regis- 
ters are read at the beginning and loaded at the end of the execute cycle. This 
means that if one of the parallel operations (STI) reads from a register and the 
operation being performed in parallel (NOT) writes to the same register, STI 
accepts the contents of the register as input before it is modified by the NOT. 
If src2 and dsi2 point to the same location, src2 is read before the write to dst2. 
Cycles 1 
Status Bits These condition flags are modified only if the destination register is R7-RO. 
LUF Unaffected 
LV Unaffected 
UF 0 
N MSB of the output 
Z 1 if a O result is generated; 0 otherwise 
Vv 0 
Cc Unaffected 
Mode Bit OVM = Operation is not affected by OVM bit value. 


13-188 


Parallel NOT and STI 


NOT||STI 


Example NOT *+AR2,R3 
| | STI R7,*--AR4 (IR1) 
Before Instruction After Instruction 
R3 R3 
R7 220 R7 220 
AR2 AR2 
AR4 AR4 
IR1 IR1 
LwuF Lo] WF [To] 
wv (a) w C9] 
uP (=i ue [9] 
N [Lo] N 
z _ a] z Lo] 
v [Oo] v [Lo] 
¢ Lo] c Lol 
Data memory 
8099CCh 8099CCh 
sogs40h [| ———— 809840h ope] 220 
Note: Cycle Count 


See subsection 8.5.2, Data Loads and Stores, on page 8-24 for the effects 
of operand ordering on the cycle count. 


Assembly Language Instructions 


13-189 


OR __Bitwise-Logical OR 


Syntax OR src, dst 
Operation dst OR src > dst 
Operands src general addressing modes (G): 


00 any CPU register 

01 direct 

10 _ indirect (disp = 0-255, IRO, IR1) 
11 immediate (not sign-extended) 


dst any CPU register 


Opcode 
31 24 23 1615 87 0 
eefrevced ew : 
Description The bitwise-logical OR between the src and dst operands is loaded into the dst 
register. The dst and src operands are assumed to be unsigned integers. 
Cycles 1 
Status Bits These condition flags are modified only if the destination register is R7—RO. 
LUF Unaffected 
LV Unaffected 
UF 0 
N MSB of the output 
Z 1 if a O result is generated; 0 otherwise 
Vv 0 
Cc Unaffected 
Mode Bit OVM = Operation is not affected by OVM bit value. 


13-190 


Bitwise-LogicalOR OR 


Example OR *++AR1(IR1) ,R2 

Before Instruction After Instruction 

R2 R2 

ARI ARI 

Ri Loa] Ri Loa] 

LuF [To] LuF [| 

vw Lo wv Lo] 

UF [_] UF [| 

N Lo] N [Lo 

z Lo z Lo] 

v Ld v Lo 

c Lo] c Lol 

Data memory 

809804h 809804h 


Assembly Language Instructions 13-191 


OR3 _ Bitwise-Logical OR, 3-Operand 


Syntax 
Operation 


Operands 


Opcode 


Description 


Cycles 


Status Bits 


Mode Bit 


13-192 


OR3 src2, src1, dst 
src1 OR src2 > dst 
src? 3-operand addressing modes (T): 


00 _ register 
01 indirect 
register 
indirect 


Rni,0< ni <27) 
disp = 0, 1, IRO, IR1) 
Rni, 0 <n < 27) 
disp = 0, 1, IRO, IR1) 


src2 3-operand addressing modes (1): 


00 register (Rn2, 0 < n2 < 27) 
01 register (Rn2, 0 < n2 < 27) 
indirect (disp = 0, 1, IRO, IR1) 


—_~oa~ 


indirect (disp = 0, 1, IRO, IR1) 


dst register (Rn, 0 < n< 27) 


31 24 23 16 15 8 7 0 


The bitwise-logical OR between the src? and src2 operands is loaded into the 
dst register. The src, src2, and dst operands are assumed to be unsigned in- 
tegers. 


1 
These condition flags are modified only if the destination register is R7—RO. 


LUF Unaffected 
LV Unaffected 


UF 0 

N MSB of the output 

Z 1 if a O result is generated; 0 otherwise 
Vv 0 

Cc Unaffected 


OVM = Operation is not affected by OVM bit value. 


Example OR3 


Cc 
Data memory 
809804h 


*++AR1(IR1) ,R2,R7 


Before Instruction 


00 1256 0000 
00 0000 0000 


80 9800 


| LMU 
| DO 


2BCD 


Bitwise-Logical OR, 3-Operand OR3 


After Instruction 


R2 00 1256 0000 
R7 0.1256 2BCD 
AR1 80 9804 


809804h 2BCD 


Note: Cycle Count 


See subsection 8.5.2, Data Loads and Stores, on page 8-24 for the effects 
of operand ordering on the cycle count. 


Assembly Language Instructions 


13-193 


OR3}||STI 


Syntax 


Operation 


Operands 


Opcode 


Cycles 


13-194 


Parallel OR3 and STI 


OR3 = src2, src1, dst? 
|| STI src3, dst2 


src? OR src2 > dst? 
| src3— dst2 


srct 
src2 
dst1 
src3 
dsi2 


register (Rn1,0< ni <7) 
indirect (disp = 0, 1, IRO, IR1) 
register (Rn2, 0 < n2 <7) 
register (Rn3, 0 < n3 <7) 
indirect (disp = 0, 1, IRO, IR1) 


L} 
_] 
_] 


This instruction’s operands have been augmented in the following devices: 


C31 silicon revision 6.0 or greater 
C82 silicon revision 2.0 or greater 
VC33 


src? register (Rn1,0<n1 <7) 
src2__ indirect (disp = 0, 1, IRO, IR1) or any CPU register 
dst? register (Rn2,0 < n2<7) 
src3_ register (Rn38, 0< n3 <7) 


3 


dsi2—indirect disp —0t IRORY 


8 7 0 


1 24 23 16 15 


A bitwise-logical OR and an integer store are performed in parallel. All registers 
are read at the beginning and loaded at the end of the execute cycle. This 
means that if one of the parallel operations (STI) reads from a register and the 
operation being performed in parallel (OR3) writes to the same register, then ST| 
accepts the contents of the register as input before it is modified by the OR3. 


If src2 and dst2 point to the same location, src2 is read before the write to dst2. 


{ 


Parallel OR3 and STI 


OR3||STI 


Status Bits These condition flags are modified only if the destination register is R7—RO. 
LUF Unaffected 
LV Unaffected 
UF 0 
N MSB of the output 
Z 1 if a O result is generated; 0 otherwise 
Vv 0 
Cc Unaffected 
Mode Bit OVM = Operation is not affected by OVM bit value. 
Example OR3 *+4+AR2,R5,R2 
| | STI R6, *AR1-— 
Before Instruction After Instruction 
R2 R2 
RS RS 
R6 220 R6 220 
ARI ARI 
AR2 AR2 
LurF [| LuF [_ 
w [9] w 9] 
ue [—— u [a] 
n [9] C7 
z C4] za] 
v C7 v 7] 
c To] clo] 
Data memory 
809831h 809831h 
go988sh  [—~—~—=iC 809883h ope] 220 
Note: Cycle Count 


See subsection 8.5.2, Data Loads and Stores, on page 8-24 for the effects 
of operand ordering on the cycle count. 


Assembly Language Instructions 


13-195 


POP Pop Integer 


Syntax 
Operation 
Operands 


Opcode 


Description 


Cycles 


Status Bits 


Mode Bit 


Example 


POP ast 
*SP-- > dst 


dst register (Rn, 0 < n< 27) 


31 


24 23 16 8 0 


15 7 
0000000000000 000 


The top of the current system stack is popped and loaded into the dst register 
(32 LSBs). The top of the stack is assumed to be a signed integer. The POP 
is performed with a postdecrement of the stack pointer. The exponent bits of 
an extended-precision register (R7—RO) are left unmodified. 


{ 


These condition flags are modified only if the destination register is R7—RO. 


LUF Unaffected 
LV Unaffected 


UF 0 

N 1 if a negative result is generated; 0 otherwise 
Z 1 if a O result is generated; 0 otherwise 

Vv 0 

Cc Unaffected 


OVM = Operation is not affected by OVM bit value. 


POP R3 


Before Instruction 


R3 00 0000 12DA 4,826 
SP 809856 


CL 
S| 
Lo 
N [Lo] 
Ll 
Ll 
[ol 


Data memory 


13-196 


g09856h -62,044 


R3 
SP 
LUF 
LV 
UF 


oOo < N Z 


809856h 


After Instruction 


OO FFFF ODA4} -62,044 
809855 


Lo 
bs _ = 
[| 
Lo 
Ld 
Lo 


FFFFODA4 | -62,044 


Syntax 
Operation 
Operands 


Opcode 


Description 


Cycles 


Status Bits 


Mode Bit 


Example 


R4 
SP 


Data memory 
80984Ah 


Pop Floating-Point Value POPF 


POPF dst 
*SP--— dst? 


dst register (Rn, O< n<7) 


31 24 23 1615 87 0 
fo 0 of 01 110104, at |o000000000000000 


The top of the current system stack (32 MSBs) is popped and loaded into the 
dst register. The top of the stack is assumed to be a floating-point number. The 
POP is performed with a postdecrement of the stack pointer. The eight LSBs 
of an extended-precision register (R7—RO) are zero-filled. 


1 
These condition flags are modified only if the destination register is R7—RO0. 


LUF Unaffected 


UF 0 

LV Unaffected 

N 1 if a negative result is generated; 0 otherwise 
Z 1 if a O result is generated; 0 otherwise 

Vv 0 

Cc Unaffected 


OVM Operation is not affected by OVM bit value. 


POPF R4 

Before Instruction After Instruction 
[02 sp2E 0123] 6.91186578e+00 R4 5.32544007e+28 
sP 
Lo wuF [| 
CC —_z3 wlio 
CF uF a] 
Lo] NCL] 
a] z[ 
[ v~L 
en) cL) 


5F2C1302| 5.32544007e+28 80984Ah 5F2C1302| 5.32544007e+28 


Assembly Language Instructions 13-197 


PUSH PUSH Integer 


Syntax 
Operation 
Operands 


Opcode 


Description 


Cycles 


Status Bits 


Mode Bit 


Example 


13-198 


PUSH src 
src > *++SP 


src register (Rn, 0 < n< 27) 


31 24 23 1615 7 
fooofor 11001, se |oo00000000000000 


8 0 


The contents of the src register (32 LSBs) are pushed on the current system 
stack. The src is assumed to be a signed integer. The PUSH is performed with 
a preincrement of the stack pointer. The integer or mantissa portion of an ex- 
tended-precision register (R7—RO) is saved with this instruction. 


LUF Unaffected 
LV Unaffected 
UF Unaffected 


N Unaffected 
Z Unaffected 
V Unaffected 
Cc Unaffected 


OVM Operation is not affected by OVM bit value. 


PUSH R6 


Before Instruction 


R6 02 5C12 8081] 633,415,688 R6 


sp sp 
we LUF 
vd Lv 
ue FJ UF 
, a] N 
z 9] z 
a] v 
¢ a c 


Data memory 


s098AFh [———i| -6 2, 044 8098AFh 


After Instruction 


02 5C12 8081 633,415,688 
8098AF 


UL 


5C128081) 1,544,716,417 


Syntax 
Operation 
Operands 


Opcode 


Description 


Cycles 


Status Bits 


Mode Bit 


Example 


Data memory 
809802h 


PUSH Floating-Point Value PUSHF 


PUSHF src 
src > *++SP 


src register (Rn, 0< n<7) 


31 


24 23 1615 8 7 0 
fooolor ssi ios] ae |ooooo 00000000000 


The contents of the src register (82 MSBs) are pushed on the current system 
stack. The src is assumed to be a floating-point number. The PUSH is per- 
formed with a preincrement of the stack pointer. The eight LSBs of the mantis- 
sa are not saved. (Note the difference in R2 and the value on the stack in the 
example below.) 


LUF Unaffected 
LV Unaffected 


UF Unaffected 
N Unaffected 
Z Unaffected 
V Unaffected 
Cc Unaffected 


OVM Operation is not affected by OVM bit value. 


PUSHF R2 
Before Instruction After Instruction 
R2 6.87725854e+00 R2 6.87725854e+00 
SP SP 
uF [ uuF[ 
wll wl 
UF [oe uF [ 
NL] eT) 
z{L_ z7L___ 
¥ = 4) vL_ 
e Lo] cL 
[i 809802h 6 .87725830e+00 


Assembly Language Instructions 13-199 


RETlcond = Return From Interrupt Conditionally 


Syntax 


Operation 


Operands 
Opcode 


Description 


Cycles 


Status Bits 


Mode Bit 


13-200 


RETlcond 


If cond is true: 
*SP —--—+ PC 
1 — ST (GIE). 


Else, continue. 


None 


31 


24 23 16 15 87 0 
011411 000 0{00/ cmd |ooo00000000000000 


A conditional return is performed. If the condition is true, the top of the stack 
is popped to the PC, and a 1 is written to the global interrupt enable (GIE) bit 
of the status register. This has the effect of enabling all interrupts for which the 
corresponding interrupt enable bit is a 1. 


The ’C3x provides 20 condition codes that can be used with this instruction 
(see Table 13-12 on page 13-30 for a list of condition mnemonics, condition 
codes, and flags). Condition flags are set on a previous instruction only when 
the destination register is one of the extended-precision registers (R7—RO) or 
when one of the compare instructions (CMPF, CMPF3, CMPI, CMPI3, TSTB, 
or TSTB3) is executed. 


LUF Unaffected 
LV Unaffected 
UF Unaffected 


N Unaffected 
Z Unaffected 
V Unaffected 
Cc Unaffected 


OVM Operation is not affected by OVM bit value. 


Example RETINZ 

Before Instruction 

PC 

SP 

sé Ld 

LuF [| 

wv Lo] 

UF [ 

N [3] 
zo] 

v Lo 

c [ol 

Data memory 

809830h 123 


Return From Interrupt Conditionally RETIcond 


809830h 


Assembly Language Instructions 


After Instruction 
0123 
80982F 


2000 


WIN 


123 


13-201 


RETScond = Return From Subroutine Conditionally 


Syntax RETScond 
Operation If cond is true: 
*SP--— PC. 
Else, continue. 
Operands None 
Opcode 
31 2423 1615 87 0 
evr riooe oo] aw [vvoosoovwenveone 
Description A conditional return is performed. If the condition is true, the top of the stack 


is popped to the PC. 


The ’C8x provides 20 condition codes that you can use with this instruction 
(see Table 13-12 on page 13-30 for a list of condition mnemonics, condition 
codes, and flags). Condition flags are set on a previous instruction only when 
the destination register is one of the extended-precision registers (R7—RO) or 
when one of the compare instructions (CMPF, CMPF3, CMPI, CMPI8, TSTB, 
or TSTB3) is executed. 


Cycles 4 
Status Bits LUF Unaffected 
LV Unaffected 
UF Unaffected 
N Unaffected 
Z Unaffected 
Vv Unaffected 
Cc Unaffected 
Mode Bit OVM = Operation is not affected by OVM bit value. 


13-202 


Return From Subroutine Conditionally RETScond 


Example RETSGE 

Before Instruction After Instruction 

PC PC 

SP SP 

lure (—i tur [i 

vw Lol wv Lol 

uF [__] uF [__o] 

n [9] , C7 

z C9] zc ___4] 
v9] vo 9 

c [Ll c Lol 

Data memory 

80983Ch 456 80983Ch 456 


Assembly Language Instructions 13-203 


RND Round Floating-Point Value 


Syntax 
Operation 


Operands 


Opcode 


Description 


Cycles 


Status Bits 


Mode Bit 


13-204 


RND src, dst 
rnd(src) > dst 
src general addressing modes (G): 


00 register (Rn, O<n<7) 

01 ~~ direct 

10 _ indirect (disp = 0-255, IRO, IR1) 
11 immediate 


dst register (Rn, O<n<7) 


31 24 23 1615 87 0 


The result of rounding the src operand is loaded into the dst register.The src 
operand is rounded to the nearest single-precision floating-point value. If the 
src operand is exactly halfway between two single-precision values, it is 
rounded to the most positive value. 


1 
These condition flags are modified only if the destination register is R7—RO. 


LUF (1 ifa floating-point underflow occurs; unchanged otherwise 

LV 1 if a floating-point overflow occurs; unchanged otherwise 

UF 1 if a floating-point underflow occurs or the src operand is 0; 
0 otherwise 

N 1 if a negative result is generated; 0 otherwise 

Z Unaffected 

Vv 1 if a floating-point overflow occurs; 0 otherwise 

Cc Unaffected 


OVM Operation is affected by OVM bit value. 


Example RND R5,R2 


Before Instruction 


Round Floating-Point Value RND 


After Instruction 


R2 07 33C1 6F00| 1.79755600e+02 


07 33Cl 6EEF|1.79755599e+02 R5 07 33Cl 6EEF] 1.79755599e+02 


LUF 
LV 
UF 


[ 
[ 
[Ld 
[ 
[_ 
[ 
[ 


QO << N Z 


R2 

R5 

LuF Li] 

w Ll 

uF [ 

N Lo] 
zCL | 

v Ld 

c Lo] 
Note: BZUF Instruction 


If a BZ instruction is executed immediately following an RND instruction with 
a 0 operand, the branch is not performed because the zero flag is not set. 
To circumvent this problem, execute a BZUF instruction instead of a BZ 
instruction. 


Assembly Language Instructions 13-205 


ROL Rotate Left 


Syntax 
Operation 
Operands 
Opcode 


Description 


Cycles 


Status Bits 


Mode Bit 


Example 


13-206 


ROL ast 
dst left-rotated 1 bit > dst 


dst register (Rn, 0 < n< 27) 


31 8 


24 23 1615 7 0 
fooo|1000%14/11, ast |oo00000000000001 


The contents of the dst operand are left rotated one bit and loaded into the dst 
register. This is a circular rotation, with the MSB simultaneously transferred 
into the carry (C) bit and the LSB. 


Rotate left: 


Cé< dst < 
> 


1 
These condition flags are modified only if the destination register is R7—RO. 


LUF Unaffected 
LV Unaffected 


UF 0 

N MSB of the output 

Z 1 if a O output is generated; 0 otherwise 

Vv 0 

Cc Set to the value of the bit rotated out of the high-order bit; unaffected 


if dst is not R7 — RO 
OVM Operation is not affected by OVM bit value. 


ROL R3 
Before Instruction After Instruction 
R3 R3 
wuF [ uF [ 
vw C4 v CJ 
UP [ed UP [_ 
N [oo NL 
2 re) zL 
v (Lo v Cd 
cL c 


Syntax 
Operation 
Operands 
Opcode 


Description 


Cycles 


Status Bits 


Mode Bit 


Example 1 


Rotate Left Through Carry ROLC 


ROLC ast 
dst left-rotated one bit through carry bit + dst 


dst register (Rn, 0 < n< 27) 


31 8 


24 23 1615 “4 0 
fooo|10010013| at |0000000000000001 


The contents of the dst operand are left rotated one bit through the carry (C) 
bit and loaded into the dst register. The MSB is rotated to the carry bit at the 
same time the carry bit is transferred to the LSB. 


Rotate left through carry bit: 


dst < 
> 


{ 
These condition flags are modified only if the destination register is R7—RO. 


LUF Unaffected 
LV Unaffected 


UF 0 

N MSB of the output 

Z 1 if a 0 output is generated; 0 otherwise 

Vv 0 

Cc Set to the value of the bit rotated out of the high-order bit; if dst is not 


R7-RO, then C is shifted into the dst but not changed 
OVM Operation is not affected by OVM bit value. 


ROLC R3 
Before Instruction After Instruction 
R3 R3 
wuF [ol uF Ld 
UF [_ UF [ 
NL N Lol 
Z[__ a) [ol 
i C4 
C c [J] 


Assembly Language Instructions 13-207 


ROLC Rotate Left Through Carry 


Example 2 


R3 
LUF 
LV 
UF 


QO << N Z 


13-208 


ROLC R3 


Before Instruction 


00 8000 4281 


Oc N Z 


After Instruction 


00 0000 8502 


Syntax 
Operation 
Operands 
Opcode 


Description 


Cycles 
Status Bits 


Mode Bit 


Example 


Rotate Right ROR 


ROR dst 
dst right-rotated one bit through carry bit > dst 
dst register (Rn, 0 < n< 27) 


31 8 0 


24 23 1615 
fooolioorodrs| dt frraataaaa antag 


The contents of the dst operand are right rotated one bit and loaded into the 
dst register. The LSB is rotated into the carry (C) bit and also transferred into 
the MSB. 


N 


Rotate right: 


{ 
These condition flags are modified only if the destination register is R7—RO. 


LUF Unaffected 

LV Unaffected 

UF 0 

N MSB of the output 

Z 1 if a O output is generated; 0 otherwise 

Vv 0 

Cc Set to the value of the bit rotated out of the high-order bit; unaffected 
if dstis not R7—RO 


OVM Operation is not affected by OVM bit value. 


ROR R7 

Before Instruction After Instruction 
R7 R7 
uF [Ld uF Ld 
UF [ol UF [ 
N Lo N 

[4] z[ 
Vv Co vod] 
cL c 


Assembly Language Instructions 13-209 


RORC Rotate Right Through Carry 


Syntax 
Operation 
Operands 
Opcode 


Description 


Cycles 
Status Bits 


Mode Bit 


Example 


13-210 


RORC ast 
dst right-rotated one bit through carry bit > dst 
dst register (Rn, 0 < n< 27) 


31 8 7 0 


24 23 1615 
foool1oor+ol1s| at frtaaataaaaatagag 


The contents of the dst operand are right rotated one bit through the status reg- 
ister’s carry (C) bit. This could be viewed as a 33-bit shift. The carry bit value 
is rotated into the MSB of the dst, while at the same time the dsfLSB is rotated 
into the carry bit. 


Rotate right through carry bit: 


oo 


< 


{ 
These condition flags are modified only if the destination register is R7—RO. 


LUF Unaffected 
LV Unaffected 


UF 0 

N MSB of the output 

Z 1 if a O output is generated; 0 otherwise 

Vv 0 

Cc Set to the value of the bit rotated out of the high-order bit; if dstis not 


R7 — RO, then C is shifted in but not changed 
OVM Operation is not affected by OVM bit value. 


RORC R4 

Before Instruction After Instruction 
Ra Ra 
wuF [ wuF [ 
UF [_] UF [| 
N N [oo] 
z Lo] zL_ 
v C9] v C4] 
c Ll c 


Syntax 


Operation 


Operands 
Opcode 


Description 


Cycles 


Status Bits 


Mode Bit 


Repeat Block RPTB 


RPTB src 


src > RE 
1 > ST (RM) 
Next PC > RS 


src long-immediate addressing mode 


31 24 23 1615 8 7 0 


01100100 src 


RPTB allows a block of instructions to be repeated RC register + 1 times with- 
out any penalty for looping. This instruction activates the block repeat mode 
of updating the PC. The src operand is a 24-bit unsigned immediate value that 
is loaded into the repeat end-address (RE) register. A 1 is written into the re- 
peat mode bit of status register ST (RM) to indicate that the PC is being up- 
dated in the repeat mode. The address of the next instruction is loaded into the 
repeat start-address (RS) register. 


RE should be greater than or equal to RS (RE = RS). Otherwise, the code 
does not repeat, even though the RM bit remains set to 1. 


LUF Unaffected 
LV Unaffected 
UF Unaffected 
N Unaffected 
Z Unaffected 
V Unaffected 
Cc Unaffected 


OVM Operation is not affected by OVM bit value. 


Assembly Language Instructions 13-211 


RPTB Repeat Block 


Example 


13-212 


RPTB 127h 

Before Instruction After Instruction 
PC PC 
RE [__ RE 
rs Ld RS 
ST 4 ST 100 
uF [ wwF Ld 
vw fd w [4 
UP [ uP [ 
NL N [Td 
z[L zL 
v Cd vbL 
cL cL 


Because the block-repeat modes modify the program counter, no 
other instruction can modify the program counter at the same time. 
The following two rules apply: 


Rule 1: The last instruction in the block (or the only instruction in a 
block of size 1) cannot be a Bcond, BR, DBcond, CALL, 
CALLcond, TRAPcond, RETlcond, RETScond, IDLE, IDLE2, 
RPTB, or RPTS. Example 7-3 on page 7-6 shows an 


incorrectly placed standard branch. 


None of the last four instructions at the bottom of the block 
(or the only instruction in a block of size 1) can be a 
BcondD, BRD, or DBcondD. Example 7-4 on page 7-7 
shows an incorrectly placed delayed branch. 


If either rule is violated, the PC will be undefined. 


Syntax 


Operation 


Operands 


Opcode 


Description 


Cycles 


Status Bits 


Mode Bit 


Repeat Single Instruction RPTS 


RPTS src 


src > RC 

1 —> ST (RM) 
13S 

Next PC — RS 
Next PC — RE 


src general addressing modes (G): 


00 register 

01 direct 

10 _ indirect (disp = 0-255, IRO, IR1) 
11 immediate 


31 24 23 1615 8 7 0 
fooolio0114/G |rs044 src 


The RPTS instruction allows you to repeat a single instruction src + 1 times 
without any penalty for looping. Fetches can also be made from the instruction 
register (IR), thus avoiding repeated memory access. 


The src operand is loaded into the repeat counter (RC). A 1 is written into the 
repeat mode bit of the status register ST (RM). A 1 is also written into the re- 
peat single bit (S). This indicates that the program fetches are to be performed 
only from the instruction register. The next PC is loaded into the repeat end-ad- 
dress (RE) register and the repeat start-address (RS) register. 


For the immediate mode, the src operand is assumed to be an unsigned inte- 
ger and is not sign extended. 


LUF Unaffected 
LV Unaffected 


UF Unaffected 
N Unaffected 
Zz Unaffected 
V Unaffected 
Cc Unaffected 


OVM Operation is not affected by OVM bit value. 


Assembly Language Instructions 13-213 


RPTS Repeat Single Instruction 


Example RPTS AR5 

Before Instruction After Instruction 
ARS AR5 
PC PC 
| er RC 
RE. [| RE 
RS Lo RS 
ST [i ST 100 
Lur [___——i uur [9] 
UF [| UF Lo] 
NL ol N Lol 
zt] z Li] 
vf — __ — =] v Lo 
cL] c [| 


Because the block-repeat modes modify the program counter, no 
other instruction can modify the program counter at the same time. 
Therefore, the repeated instruction cannot be a Bcond, BR, 
DBcond, CALL, CALLcond, TRAPcond, RETIcond, RETScond, 


IDLE, IDLE2, RPTB, or RPTS. If this rule is violated, the PC will be 
undefined. 


Note: 


The RPTS instruction cannot be interrupted because instruction fetches are 
halted. 


a) 


13-214 


Syntax 


Operation 


Operands 
Opcode 


Description 


Cycles 


Status Bits 


Mode Bit 


Example 


Signal, Interlocked SIGI 


SIGI 


Signal interlocked operation. 
Wait for interlock acknowledge. 
Clear interlock. 


None 


31 24 23 1615 OF 0 
101410 of 00| 0000000000000 00000000 


An interlocked operation is signaled over XFO and XF1. After the interlocked 
operation is acknowledged, the interlocked operation ends. SIGI ignores the 
external ready signals. Refer to Section 7.4, /nterlocked Operations, on page 
7-13 for detailed information. 


LUF Unaffected 
LV Unaffected 
UF Unaffected 
N Unaffected 
Zz Unaffected 
V Unaffected 
Cc Unaffected 


OVM Operation is not affected by OVM bit value. 


SIGI ; The processor sets XFO to 0, idles 
; until XF1 is set to 0, and then 
; sets XFO to l. 


Assembly Language Instructions 13-215 


STF Store Floating-Point Value 


Syntax 
Operation 


Operands 


Opcode 


Description 


Cycles 


Status Bits 


Mode Bit 


Example 


13-216 


R2 
DP 
LUF 
LV 
UF 


Cc 
Data memory 
8098A1h 


STF src, dst 

src > dst 

src register (Rn, 0 < n< 7) 

dst general addressing modes (G): 


01 direct 
10 _ indirect (disp = 0-255, IRO, IR1) 


31 2423 16 15 8 7 0 


The src register is loaded into the dst memory location. The src and dst oper- 
ands are assumed to be floating-point numbers. 


LUF Unaffected 
LV Unaffected 
UF Unaffected 


N Unaffected 
Z Unaffected 
V Unaffected 
Cc Unaffected 


OVM Operation is not affected by OVM bit value. 


STF R2,@98Al1h 


Before Instruction After Instruction 
4..30782204e+01 R2 [05 2c50 1900] 4.30782204e+01 
DP 080 
Ll Lur [_ 
ee: w Loo 
—er ur [a 
es NT] 
Lo] z{L 
[Ld v Lo 
a) cL 
[i go9sAth [__52C5019] 4.30782204e+01 


Store Floating-Point Value, Interlocked STF 


Syntax STFI src, dst 


Operation src > dst 
Signal end of interlocked operation. 


Operands src register (Rn, 0< n<7) 
dst general addressing modes (G): 


01 direct 
10 _ indirect (disp = 0-255, IRO, IR1) 


Opcode 
31 24 23 16 15 87 0 
pooper wore) : 

Description The src register is loaded into the dst memory location. An interlocked opera- 
tion is signaled over pins XFO and XF1. The src and dstoperands are assumed 
to be floating-point numbers. Refer to Section 7.4, Interlocked Operations, on 
page 7-13 for detailed information. 

Cycles 1 

Status Bits LUF Unaffected 
LV Unaffected 
UF Unaffected 
N Unaffected 
Z Unaffected 
V Unaffected 
Cc Unaffected 

Mode Bit OVM Operation is not affected by OVM bit value. 

Example STFI R3,*-AR4 

Before Instruction After Instruction 
R3 1.79750e+02 R3 1.79750e+02 
AR4 AR4 
Lue [CY LuF Lo] 
vt vt 
uF L___ uF [Lo 
N [ol N[ 
zd z— id 
vod vo] 
cL cL] 
Data memory 
gos9sBh [CY 80993Bh 1.79750e+02 


Assembly Language Instructions 13-217 


STFI Store Floating-Point Value, Interlocked 


aaa. | 
Note: 


The STFI instruction is not interruptible because it completes when ready is 


signaled. See Section 7.4, Interlocked Operations, on page 7-13. 
| 


13-218 


Parallel Store Floating-Point Value STF||STF 


Syntax STF src2, dst2 

|| STF srct, dst? 
Operation src2 > dst2 

|| src1 — dst1 
Operands src? register (Rn1,0<n1 <7) 


dst? indirect 
src2__ register 
dst2 indirect 


— 


disp = 0, 1, IRO, IR1) 
Rn2, 0 < n2<7) 
disp = 0, 1, IRO, IR1) 


—a~ —~ 


This instruction’s operands have been augmented on the following devices: 


J ’C31 silicon revision 6.0 or greater 
Lj ’C2 silicon revision 2.0 or greater 
L) ’VC33 


src1_ register (Rn1i, 0 < n1 <7) 

dst1 indirect (disp = 0, 1, IRO, IR1) 

src2_ register (Rn2,0 < n2<7) 

dst2— indirect (disp = 0, + tRORT) or any GPU register 


Opcode 
31 24 23 1615 87 0 
ecwed salves] ar 
Description Two STF instructions are executed in parallel. Both src7 and src2 are assumed 
to be floating-point numbers. 
Cycles 1 
Status Bits LUF Unaffected 
LV Unaffected 
UF Unaffected 
N Unaffected 
Z Unaffected 
Vv Unaffected 
Cc Unaffected 
Mode Bit OVM Operation is not affected by OVM bit value. 


Assembly Language Instructions 13-219 


STF||STF Parallel Store Floating-Point Value 


Example STF R4, *AR3-- 
| | STF R3,*++AR5 
Before Instruction After Instruction 
R3 [07 33c0 0000] 1.79750e+02 R3 [07 33co 0000] 1.79750e+02 
R4 [07 0c8o 0000] 1.4050e+02 R4 [07 ocg0 0000] 1.4050e+02 
AR3 AR3 
ARS ARS 
wr Ll wuF [| 
vw [9] w [9] 
uP [—— ue 
Nn [9] n [9] 
z Lo 2 ee | 
v LJ v Lo] 
c Lo] ec [ol 
Data memory 
gogs3sh [| ——i 809835h [__070c8000] 1.4050e+02 
gosep3h_ [oj sogeD3h [____0733c000] 1.79750e+02 


Note: Cycle Count 


See subsection 8.5.2, Data Loads and Stores, on page 8-24 for the effects 
of operand ordering on the cycle count. 


13-220 


Syntax 
Operation 


Operands 


Opcode 


Description 


Cycles 


Status Bits 


Mode Bit 


Example 


Store Integer STI 


STI src, dst 

src > dst 

src register (Rn, 0 < n< 27) 

dst general addressing modes (G): 


01 direct 
10 _ indirect (disp = 0-255, IRO, IR1) 


31 24 23 16 15 87 0 


The src register is loaded into the dst memory location. The src and dst oper- 
ands are assumed to be signed integers. 


LUF Unaffected 
LV Unaffected 
UF Unaffected 
N Unaffected 
Zz Unaffected 
V Unaffected 
Cc Unaffected 


OVM = Operation is not affected by OVM bit value. 


STI R4,@982Bh 


Before Instruction After Instruction 
R4 [00 0004 2BD7] 273,367 R4 273,367 
DP 080 DP 080 
LuE [CO] Lur Lo 
vw [ol v Ld 
uF [ UF [J 
N [Oo] N[ 
A | zCL 
v 0] v [] 
c Lo] c [ol 


Data memory 


80982Bh 58,876  80982Bh 273,367 


Assembly Language Instructions 13-221 


STII Store Integer, interlocked 


Syntax STII src, dst 


Operation src > dst 
Signal end of interlocked operation 


Operands src register (Rn, 0 < n< 27) 


dst general addressing modes (G): 
01 direct 
10 _ indirect (disp = 0-255, IRO, IR1) 


Opcode 
31 2423 1615 87 0 
poorer el |e 
Description The src register is loaded into the dst memory location. An interlocked opera- 
tion is signaled over pins XFO and XF1. The src and dst operands are assumed 
to be signed integers. Refer to Section 7.4, Interlocked Operations, on page 
7-13 for detailed information. 
Cycles 1 
Status Bits LUF Unaffected 
LV Unaffected 
UF Unaffected 
N Unaffected 
Z Unaffected 
Vv Unaffected 
Cc Unaffected 
Mode Bit OVM = Operation is not affected by OVM bit value. 
Example STII R1,@98AEh 
Before Instruction After Instruction 
RL RL 
DP DP 080 
Data memory 
8098AEh 25C 8098AEh 78D 
Note: 


The STII instruction is not interruptible because it completes when ready is 
signaled. See Section 7.4, Interlocked Operations, on page 7-13. 


13-222 


Syntax 


Operation 


Operands 


Opcode 


Description 


Cycles 
Status Bits 


Mode Bit 


Parallel STI and ST! STI||STI 


STI src2, dsi2 
|| STI src?, dst? 


src2 > dst2 
|| srct1 > dst1 


src1_ register 
dst? indirect 
src2_ register 
dst2 indirect 


Ri, 0<n1 <7) 
disp = 0, 1, IRO, IR1) 
Rn2, 0 < n2<7) 
disp = 0, 1, IRO, IR1) 


~~ 


This instruction’s operands have been augmented on the following devices: 


J ’C31 silicon revision 6.0 or greater 
_j ’C32 silicon revision 2.0 or greater 
L) ’VC33 


src1_ register (Rni, 0 < n1 <7) 

dst? indirect (disp = 0, 1, IRO, IR1) 

src2_ register (Rn2,0< n2<7) 

ast2—indirect (aisp = 6, t+ tRO RT) or any CPU Tegister 


31 24 23 1615 8 7 0 
000 01 | 2 [000 srct dst1 dst2 


Two integer stores are performed in parallel. If both stores are executed to the 
same address, the value written is that of STI src2, dsi2. 


LUF Unaffected 
LV Unaffected 
UF Unaffected 
N Unaffected 
Z Unaffected 
V Unaffected 
Cc Unaffected 


OVM Operation is not affected by OVM bit value. 


Assembly Language Instructions 13-223 


STI||STI 


Example 


Parallel STI and STI 


STI RO,*++AR2 (IRO) 


STI R5,*ARO 


Before Instruction 


After Instruction 


RO 220 RO 220 
R5 53 R5 53 
ARO ARO 
AR2 AR2 
IRO IRO 
LUE [| LUE [_] 
w [9] w L___9 
uP up [9] 
N [Lo] N [Lo] 
z Lo] z Lo] 
v Lo] v Lo] 
c Lol Cc Lo] 
Data memory 
goge3sh  [— 809838h onc] 220 
gogsDsh [| t—~—SC 8098D3h 53 
Note: Cycle Count 


See subsection 8.5.2, Data Loads and Stores, on page 8-24 for the effects 
of operand ordering on the cycle count. 


13-224 


Syntax 
Operation 


Operands 


Opcode 


Description 


Cycles 
Status Bits 


Mode Bit 


Example 


Subtract Integer With Borrow SUBB 


SUBB src, dst 
dst- src-C — dst 
src general addressing modes (G): 


00 register (Rn, 0 < n< 27) 

01 direct 

10 indirect (disp = 0-255, IRO, IR1) 
11 immediate 


dst register (Rn, O< n< 27) 


31 24 23 1615 87 0 


The difference of the dst, src, and C operands is loaded into the dst register. 
The dst and src operands are assumed to be signed integers. 


{ 
These condition flags are modified only if the destination register is R7—RO. 


LUF Unaffected 

LV 1 if an integer overflow occurs; unchanged otherwise 
UF 0 

N 1 if a negative result is generated; 0 otherwise 

Z 1 if a O result is generated; 0 otherwise 

V 1 if an integer overflow occurs; 0 otherwise 

Cc 1 if a borrow occurs; 0 otherwise 


OVM Operation is affected by OVM bit value. 


SUBB *AR5++(4),R5 


Before Instruction After Instruction 
R5 250 RS 50 
ARS ARS 
Lue (CY Lure [| 
i wv Lol 
uF L___ | UF Lo] 
N [Lo] N [oo] 
z C9] zC__ 
vo 4 v LJ] 
C c Lol 
Data memory 
809800h oc7| 199 809800h oc7} 199 


Assembly Language Instructions 13-225 


SUBB3 Subtract Integer With Borrow, 3-Operand 


Syntax 
Operation 


Operands 


Opcode 


Description 


Cycles 


Status Bits 


Mode Bit 


13-226 


SUBB3 src2, src1, dst 
src1 — src2- C > dst 
src? 3-operand addressing modes (T): 


00 _ register 
01 indirect 
register 
indirect 


Rni, 0 <n < 27) 
disp = 0, 1, IRO, IR1) 
Rni,0< ni <27) 
disp = 0, 1, IRO, IR1) 


src2 3-operand addressing modes (T): 


00 _~ register 
01 register 
indirect 
indirect 


Rn2, 0 < n2 < 27) 
Rn2, 0 < n2 < 27) 
disp = 0, 1, IRO, IR1) 
disp = 0, 1, IRO, IR1) 


dst register (Rn, 0 < n< 27) 


31 24 23 1615 87 0 


The difference among the src? and src2 operands and the C flag is loaded into 
the dst register. The src7, src2, and dst operands are assumed to be signed 
integers. 


1 
These condition flags are modified only if the destination register is R7—RO. 


LUF Unaffected 


LV 1 if an integer overflow occurs; unchanged otherwise 
UF 0 

N 1 if a negative result is generated; 0 otherwise 

Z 1 if a O result is generated; 0 otherwise 

Vv 1 if an integer overflow occurs; 0 otherwise 

Cc 1 if a borrow occurs; 0 otherwise 


OVM Operation is affected by OVM bit value. 


Subtract Integer With Borrow, 3-Operand SUBB3 


Example SUBB3 R5,*AR5++(IRO) ,RO 

Before Instruction After Instruction 
RO RO 50 
RS 199 RS 199 

ARS ARS 

RO [OO ] Ro [al 

tur (~-_______] wwF [ 

vw — 9] w a] 

uF u (i 

Nn Cg nC 9 

z Lo z{[ 

v Lo] v Lol 

c cL 

Data memory 

809800h 250 809800h 250 


Note: Cycle Count 


See subsection 8.5.2, Data Loads and Stores, on page 8-24 for the effects 
of operand ordering on the cycle count. 


Assembly Language Instructions 13-227 


SUBC = Subtract Integer Conditionally 


Syntax 


Operation 


Operands 


Opcode 


Description 


Cycles 
Status Bits 


Mode Bit 


13-228 


SUBC src, dst 


If (dst — src= 0): 

(dst- src << 1) OR1 — ast 
Else: 

dst << 1 > dst 


src general addressing modes (G): 


00 register (Rn, O< n< 27) 

01 direct 

10 _ indirect (disp = 0-255, IRO, IR1) 
11 immediate 


dst register (Rn, 0 < n< 27) 


31 24 23 16 15 87 0 


The src operand is subtracted from the dst operand. The dst operand is loaded 
with a value dependent on the result of the subtraction. If (dst— src) is greater 
than or equal to 0, then (dst — src) is left-shifted one bit, the least significant 
bit is set to 1, and the result is loaded into the dst register. If (dst — src) is less 
than 0, dstis left-shifted one bit and loaded into the dst register. The dst and 
src operands are assumed to be unsigned integers. 


You can use SUBC to perform a single step of a multi-bit integer division. See 
the TMS320C3x General Purpose Applications Guide for a detailed descrip- 
tion. 


LUF Unaffected 
LV Unaffected 
UF Unaffected 


N Unaffected 
Z Unaffected 
V Unaffected 
Cc Unaffected 


OVM = Operation is not affected by OVM bit value. 


Subtract Integer Conditionally SUBC 


Example 1 SUBC @98C5h,R1 
Before Instruction After Instruction 
Rt [__00 0000 o4Fe] 1270 Rt 201 
DP 080 DP 080 
Lue [CO Lue [___ 
w [ol w Lo 
uF Ea] uF [ 
n [9 i ) 
z C9 zC 9 
i vo 9 
cL c [To 
Data memory 
8098C5h 492] 1170  8098C5h 1170 
Example 2 SUBC 3000,RO (3000 = OBB8h) 
Before Instruction After Instruction 
RO 2000 RO 4000 
uF [_ uF [_ 
UF [ Ue [= a] 
N Lo N Lol 
z C4] zc 9] 
v C9] vod 
cf oj cf 9] 


Assembly Language Instructions 13-229 


SUBF = Subtract Floating-Point Value 


Syntax 
Operation 


Operands 


Opcode 


Description 


Cycles 
Status Bits 


Mode Bit 


13-230 


SUBF src, dst 
dst — src > dst 


src general addressing modes (G): 


00 register (Rn,O<n<7) 

01 direct 

10 _ indirect (disp = 0-255, IRO, IR1) 
11 immediate 


dst register (Rn, 0 < n< 7) 


31 24 23 1615 87 0 


The difference between the dst operand and the src operand is loaded into the 
dst register. The dst and src operands are assumed to be floating-point num- 
bers. 


{ 


These condition flags are modified only if the destination register is R7—RO. 


LUF (1 ifa floating-point underflow occurs; unchanged otherwise 
LV 1 if a floating-point overflow occurs; unchanged otherwise 
UF 1 if a floating-point underflow occurs; 0 otherwise 

N 1 if a negative result is generated; 0 otherwise 

Z 1 if a O result is generated; 0 otherwise 

Vv 1 if a floating-point overflow occurs; 0 otherwise 


Cc Unaffected 
OVM = Operation is not affected by OVM bit value. 


Subtract Floating-Point Value SUBF 


Example SUBF *ARO-- (IRO),R5 

Before Instruction After Instruction 
R5 1.79750000e+02 R5 3.9250e+01 

ARO ARO 

IRO IRO 

wr [_ i 

w Lo wf 

UF [__ UP Ld 

N CL N[ 

zt zL_ 

v [ol v Lo] 

¢ Lo] ¢ Lo 

Data memory 

809888h 1.4050e+02 809888h 1.4050e+02 


Assembly Language Instructions 13-231 


SUBF3 Subtract Floating-Point Value, 3-Operand 


Syntax 
Operation 


Operands 


Opcode 


Description 


Cycles 
Status Bits 


Mode Bit 


13-232 


SUBF3 src2, src1, dst 
src? — src2 > dst 


src? 3-operand addressing modes (T): 


00 register (Rni,O<n1 <7) 

01 indirect (disp = 0, 1, IRO, IR1) 
register (Rn1, 0 < n1 <7) 
indirect (disp = 0, 1, IRO, IR1) 


src2 3-operand addressing modes (T): 


00 register (Rn2,0< n2<7) 

01 register (Rn2, 0 < n2 <7) 
indirect (disp = 0, 1, IRO, IR1) 
indirect (disp = 0, 1, IRO, IR1) 


dst register (Rn, 0 < n< 7) 


31 24 23 1615 87 0 


The difference between the src? and src2 operands is loaded into the dst reg- 
ister. The src7, src2, and dst operands are assumed to be floating-point num- 
bers. 


{ 


These condition flags are modified only if the destination register is R7—RO. 


LUF (1 ifa floating-point underflow occurs; unchanged otherwise 
LV 1 if a floating-point overflow occurs; unchanged otherwise 
UF 1 if a floating-point underflow occurs; 0 otherwise 

N 1 if a negative result is generated; 0 otherwise 

Z 1 if a O result is generated; 0 otherwise 

Vv 1 if a floating-point overflow occurs; 0 otherwise 


Cc Unaffected 
OVM = Operation is not affected by OVM bit value. 


Example 1 


Example 2 


Subtract Floating-Point Value, 3-Operand SUBF3 


SUBF3 *ARO-- (IRO) , *AR1,R4 


Before Instruction 


Cc 


Data memory 


809888h [__70c8000] 1.4050e+02 
809851h 1.79750e+02 


SUBF3 R7,RO,R6 


Before Instruction 


UF 


ao -<=— MN Zz 


809888h 
809851h 


After Instruction 

05 1D00 0000| 3,.9250e+01 
80 9808 
80 9851 


INN 


7OC8000} 1.4050e+02 
733C000| 1.79750e+02 


After Instruction 


RO 03 4C20 0000] 1.27578125e+01 
R6 05 B7C8 0000}-5.00546875e+01 
R7 05 _7B40 0000] 6.281250e+01 


QO < N Z 


HILL 


Note: Cycle Count 


See subsection 8.5.2, Data Loads and Stores, on page 8-24 for the effects 
of operand ordering on the cycle count. 


Assembly Language Instructions 13-233 


SUBF3||STF Parallel SUBF3 and STF 


Syntax SUBF3 src1, src2, dst1 
|| STF src3, dst2 

Operation src2— src1 > dst1 
|| src3 > dsi2 

Operands src? register (Rn1,0<n1 <7) 
src2__ indirect (disp = 0, 1, IRO, IR1) 


dst? register (Rn2, 0 < n2<7) 
src3_ register (Rn8, 0 < n3 <7) 
dst2 indirect (disp = 0, 1, IRO, IR1) 


This instruction’s operands have been augmented in the following devices: 


_j ’C31 silicon revision 6.0 or greater 
_j ’C32 silicon revision 2.0 or greater 
Lj ‘VC33 


src? register (Rn1,0<n1 <7) 

src2__ indirect (disp = 0, 1, IRO, IR1) or any CPU register 
dst? register (Rn2,0 < n2<7) 

src3_ register (Rn8, 0 < n3 <7) 

dsi2—indireet {disp =, 4+ IROARH 


31 24 23 1615 87 0 


Description A floating-point subtraction and a floating-point store are performed in parallel. 
All registers are read at the beginning and loaded at the end of the execute 
cycle. If one of the parallel operations (STF) reads from a register and the oper- 
ation being performed in parallel (SUBF3) writes to the same register, STF ac- 
cepts the contents of the register as input before it is modified by the SUBF3. 


Opcode 


If src3 and dst7 point to the same location, src3 is read before the write to dst7. 


Cycles 1 
Status Bits These condition flags are modified only if the destination register is R7—RO. 
LUF (1 ifa floating-point underflow occurs; unchanged otherwise 
LV 1 if a floating-point overflow occurs; unchanged otherwise 
UF 1 if a floating-point underflow occurs; 0 otherwise 
N 1 if a negative result is generated; 0 otherwise 
Z 1 if a O result is generated; 0 otherwise 
Vv 1 if a floating-point overflow occurs; 0 otherwise 
Cc Unaffected 
Mode Bit OVM = Operation is not affected by OVM bit value. 


13-234 


Example 


Cc 

Data memory 
8098B0h 
809860h 


SUBF3 
| | ST 


ny 


Before Instruction 


70C8000}1.4050e+02 


Note: Cycle Count 


Parallel SUBF3 and STF_~SUBF3||STF 


R1,*-AR4 (IR1) ,RO 
R7,*+AR5 (IRO) 


RO 06 1B60 0000] 7.768750e+01 


R1 05 7B40 0000| 6.28125e+01 
R7 07_33C0 0000] 1.79750e+02 
80 98B8 
80 9850 


IRO 0 


8098B0h 1.4050e+02 


809860h 


70C8000 


733C000 1.79750e+02 


MITT 


See subsection 8.5.2, Data Loads and Stores, on page 8-24 for the effects 
of operand ordering on the cycle count. 


Assembly Language Instructions 13-235 


SUBI = Subtract Integer 


Syntax 
Operation 


Operands 


Opcode 


Description 


Cycles 


Status Bits 


Mode Bit 


Example 


13-236 


SUBI src, dst 
dst - src > dst 
src general addressing modes (G): 


00 register (Rn, O< n< 27) 

01 direct 

10 _ indirect (disp = 0-255, IRO, IR1) 
11 immediate 


dst register (Rn, 0 < n< 27) 


31 24 23 1615 87 0 

eofrrecede] a : 

The difference between the dst operand and the src operand is loaded into the 
dst register. The dst and src operands are assumed to be signed integers. 

1 

These condition flags are modified only if the destination register is R7—RO. 


LUF Unaffected 


LV 1 if an integer overflow occurs; unchanged otherwise 
UF 0 

N 1 if a negative result is generated; 0 otherwise 

Z 1 if a O result is generated; 0 otherwise 

Vv 1 if an integer overflow occurs; 0 otherwise 

Cc 1 if a borrow occurs; 0 otherwise 


OVM Operation is affected by OVM bit value. 


SUBI 220,R7 


Before Instruction After Instruction 
R7 550 R7 330 
uF [ WF [od 
v Lg w Lol 
UP [_ uF [| 
N [oO NL 
z Li z {ol 
v C9] vy Lo] 
cL c [To] 


Syntax 
Operation 


Operands 


Opcode 


Description 


Cycles 
Status Bits 


Mode Bit 


Subtract Integer, 3-Operand SUBI3 


SUBI3 src2, src1, dst 
src? — src2 > dst 
src? 3-operand addressing modes (T): 


00 register (Rn1, 0 <n1 < 27) 
01 _ indirect (disp = 0, 1, IRO, IR1) 
10. register (Rn1,0<1n1 < 27) 

1 indirect (disp = 0, 1, IRO, IR1) 


src2 3-operand addressing modes (T): 


00 register (Rn2, 0 < n2< 27) 

01 register (Rn2,0< n2< 27) 

10 _ indirect (disp = 0, 1, IRO, IR1) 
1 indirect (disp = 0, 1, IRO, IR1) 


dst register (Rn, 0 < n< 27) 


31 24 23 1615 87 0 


The difference between the src? operand and the src2 operand is loaded into 
the dst register. The src7, src2, and dst operands are assumed to be signed 
integers. 


{ 
These condition flags are modified only if the destination register is R7—-RO. 


LUF Unaffected 


LV 1 if an integer overflow occurs; unchanged otherwise 
UF 0 

N 1 if a negative result is generated; 0 otherwise 

Z 1 if a O result is generated; 0 otherwise 

V 1 if an integer overflow occurs; 0 otherwise 

Cc 1 if a borrow occurs; 0 otherwise 


OVM Operation is affected by OVM bit value. 
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SUBI3 = Subtract Integer, 3-Operand 


Example 1 SUBI3 R7,R2,RO0 
Before Instruction After Instruction 
RO RO 50 
R2 2150 R2 2150 
R7 2100 R7 2100 
wr [_ LF Ld 
2 eT wv Lo 
ue (a ue [Si 
N C7 N 
z Cd zC 9 
v Lo v Lol 
rl i a] 
Example 2 SUBI3 *-AR2(1),R4,R3 
Before Instruction After Instruction 
R3 R3 330 
Ra 550 Rd 550 
AR2 AR2 
WWF [ wuF [| 
v Lol wv [oo] 
UPL UF [I 
N [0] N Lo] 
Zz CL) z [TL 
v Lo) v Lo] 
ec [ol G. [= — = 4] 
Data memory 
80985Dh ODC] 220 80985Dh ODC}] 220 
Note: Cycle Count 


See subsection 8.5.2, Data Loads and Stores, on page 8-24 for the effects 


of operand ordering on the cycle count. 
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Syntax 


Operation 


Operands 


Opcode 


Description 


Cycles 
Status Bits 


Mode Bit 


Parallel SUBI3 and ST! SUBI3||STI 


SUBI3_ srci, src2, dst1 
|| STI src3, dst2 


src2 — src1 > dst1 
l| src3— dst2 


src1_ register (Rn1,0<n1 <7) 
src2__ indirect (disp = 0, 1, IRO, IR1) 
dst? register (Rn2, 0 < n2 <7) 
src3_ register (Rn8, 0< n3 <7) 
dst2 indirect (disp = 0, 1, IRO, IR1) 


This instruction’s operands have been augmented in the following devices: 


_j ’C31 silicon revision 6.0 or greater 
J ’C2 silicon revision 2.0 or greater 
LJ ’VC33 


src? register (Rn1,0<n1 <7) 

src2__ indirect (disp = 0, 1, IRO, IR1) or any CPU register 
dst? register (Rn2, 0 < n2 <7) 

src3_ register (Rn3, 0 < n3 <7) 

dsi2—indirect_{disp= 6+ tROARH, 


31 24 23 1615 8 7 0 


An integer subtraction and an integer store are performed in parallel. All regis- 
ters are read at the beginning and loaded at the end of the execute cycle. This 
means that if one of the parallel operations (STI) reads from a register and the 
operation being performed in parallel (SUBI3) writes to the same register, STI 
accepts the contents of the register as input before it is modified by the SUBIS. 


If src3 and dst7 point to the same location, src3 is read before the write to dst7. 
1 

These condition flags are modified only if the destination register is R7—RO. 
LUF Unaffected 


LV 1 if an integer overflow occurs; unchanged otherwise 
UF 0 

N 1 if a negative result is generated; 0 otherwise 

Z 1 if a O result is generated; 0 otherwise 

V 1 if an integer overflow occurs; 0 otherwise 

Cc 1 if a borrow occurs; 0 otherwise 


OVM Operation is affected by OVM bit value. 
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SUBI3||STI = Paralle! SUBI3 and STI 
Example SUBI3 R7,*+AR2(IRO),R1 
| | STI R3,*++AR7 
Before Instruction After Instruction 
Rt Rt 200 
R3 53 R3 53 
RT 20 R7 i 
AR2 AR2 
AR7 ART 
IRO IRO 
uF uF Ci 
UP [ UF [__ 
vn [9] nC 9 
z Ld z CL 
v CJ v C7 
c Cd cfd 
Data memory 

80983Fh oDCc| 220 80983Fh ODC] 220 

gogssch [si 80983Ch 53 
Note: Cycle Count 


See subsection 8.5.2, Data Loads and Stores, on page 8-24 for the effects 


of operand ordering on the cycle count. 
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Syntax 
Operation 


Operands 


Opcode 


Description 


Cycles 
Status Bits 


Mode Bit 


Example 


Subtract Reverse Integer With Borrow SUBRB 


SUBRB src, dst 
src — dst- C > dst 
src general addressing modes (G): 


00 register (Rn, O<n< 27) 

01 direct 

10 _ indirect (disp = 0-255, IRO, IR1) 
11 immediate 


dst register (Rn, O< n< 27) 


31 2423 1615 87 0 


The difference of the src, dst, and C operands is loaded into the dst register. 
The dst and src operands are assumed to be signed integers. 


{ 
These condition flags are modified only if the destination register is R7—RO. 


LUF Unaffected 


LV 1 if an integer overflow occurs; unchanged otherwise 
UF 0 

N 1 if a negative result is generated; 0 otherwise 

Z 1 if a O result is generated; 0 otherwise 

V 1 if an integer overflow occurs; 0 otherwise 

Cc 1 if a borrow occurs; 0 otherwise 


OVM Operation is affected by OVM bit value. 


SUBRB R4,R6 


Before Instruction After Instruction 
R4 971 RA 971 
R6 600 R6 370 
lure [CO LurF [__] 
vw [ol vw Lol 
UF Lo UF [___ 
N [Lol N Lo] 
z C9] a) 
i C7 
c c Lo) 


Assembly Language Instructions 13-241 


SUBRF Subtract Reverse Floating-Point Value 


Syntax 
Operation 
Operands 
Opcode 
Description 
Cycles 
Status Bits 
Mode Bit 
Example 
R5 
DP 
LUF 
LV 
UF 
N 
Z 
Vv 
Cc 
Data memory 
809905h 
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SUBRF src, dst 
src — dst > dst 
src general addressing modes (G): 


00 register (Rn, 0 <n<7) 

01 direct 

10 _ indirect (disp = 0-255, IRO, IR1) 
11 immediate 


dst register (Rn, 0< n< 7) 


31 15 8 7 


24 23 16 0 


The difference between the src operand and the dst operand is loaded into the 
dst register. The dst and src operands are assumed to be floating-point num- 
bers. 


{ 


These condition flags are modified only if the destination register is R7—RO. 


LUF (1 ifa floating-point underflow occurs; unchanged otherwise 
LV 1 if a floating-point overflow occurs; unchanged otherwise 
UF 1 if a floating-point underflow occurs; 0 otherwise 

N 1 if a negative result is generated; 0 otherwise 

Z 1 if a O result is generated; 0 otherwise 

Vv 1 if a floating-point overflow occurs; 0 otherwise 


Cc Unaffected 
OVM Operation is not affected by OVM bit value. 
SUBRF @9905h,R5 


Before Instruction After Instruction 

6.281250e+01 R5 1.16937500e+02 
080 DP 080 

[ae LuF [] 

[ w Lo] 

[Ld UF [| 

[Lo] N Lo] 

eee z Lo 

L ¥ La) 

Ld c Lo] 


733CO000} 1.79750e+02 809905h 733C000| 1.79750e+02 


Syntax 
Operation 
Operands 


Opcode 


Description 


Cycles 
Status Bits 


Mode Bit 
Example 


Subtract Reverse Integer SUBRI 


SUBRI src, dst 
src — dst > dst 
src general addressing modes (G): 


00 register (Rn, O<n< 27) 

01 direct 

10 _ indirect (disp = 0-255, IRO, IR1) 
11 immediate 


dst register (Rn, O< n< 27) 


31 24.23 1615 8 0 


The difference between the src operand and the dst operand is loaded into the 
dst register. The dst and src operands are assumed to be signed integers. 


1 


N 


These condition flags are modified only if the destination register is R7—RO. 
LUF Unaffected 


LV 1 if an integer overflow occurs; unchanged otherwise 
UF 0 

N 1 if a negative result is generated; 0 otherwise 

Z 1 if a O result is generated; 0 otherwise 

V 1 if an integer overflow occurs; 0 otherwise 

Cc 1 if a borrow occurs; 0 otherwise 


OVM Operation is affected by OVM bit value. 
SUBRI *AR5++(IRO) ,R3 


Data memory 


Before Instruction After Instruction 

R3 220 R3 330 
ARS ARS 
IRO IRO 
lwuF [_ uF [_ 
vw [oo vw Lo 
UF [ol UP [od 
N C9] vcd 
zt | zC J 
v Ll v Ll 
c Ld c Lo 

809900h 226| 550 809900h 226} 550 
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SWI Software Interrupt 


Syntax 
Operation 
Operands 
Opcode 


Description 


Cycles 


Status Bits 


Mode Bit 


13-244 


SWI 
Performs an emulation interrupt 


None 


31 2423 1615 8 7 0 
011001t10000;j0000000000000000000 00 


The SWI instruction performs an emulator interrupt. This is a reserved instruc- 
tion and should not be used in normal programming. 


LUF Unaffected 
LV Unaffected 
UF Unaffected 
N Unaffected 
Z Unaffected 
V Unaffected 
Cc Unaffected 


OVM Operation is not affected by OVM bit value. 


Syntax 


Operation 


Operands 
Opcode 


Description 


Cycles 


Status Bits 


Mode Bit 


Trap Conditionally TRAPcond 


TRAPcond N 


0 > ST(GIE) 

If cond is true: 
Next PC —> *++SP, 
Trap vector N > PC. 


Else: 


Set ST(GIE) to original state. 
Continue. 


N(O<N¥31) 


31 2423 1615 87 0 
011101 0/0000| cn |oo000000001 N+ 20h 


Interrupts are disabled globally when 0 is written to ST(GIE). If the condition 
is true, the contents of the PC are pushed onto the system stack, and the PC 
is loaded with the contents of the specified trap vector (N). If the condition is 
not true, ST(GIE) is set to its value before the TRAP cond instruction changes 
it. 


The ’C8x provides 20 condition codes that can be used with this instruction 
(see Table 13-12 on page 13-30 for a list of condition mnemonics, condition 
codes, and flags). Condition flags are set on a previous instruction only when 
the destination register is one of the extended-precision registers (R7—RO) or 
when one of the compare instructions (CMPF, CMPF3, CMPI, CMPI3, TSTB, 
or TSTB3) is executed. 


LUF Unaffected 
LV Unaffected 
UF Unaffected 
N Unaffected 
Zz Unaffected 
V Unaffected 
Cc Unaffected 


OVM Operation is not affected by OVM bit value. 
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TRAPcond Trap Conditionally 


Example TRAPZ 16 

Before Instruction 

PC 

SP 

sé Ld 

wF Lo] 

vw Lol 

UF Lo] 

N 19] 

zC 9 

v Loo] 

c Lo) 

Data memory 

Trap V.16 
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PC 
SP 
ST 
LUF 
LV 
UF 


oOo <= N Zz 


809871h 


After Instruction 


0010 
809871 


Lo 
——— | 
[ 
[ 
a) 
Lo 
Lo 
Lo 


124 


Test Bit Fields TSTB 


Syntax TSTB src, dst 
Operation dst AND src 
Operands src general addressing modes (G): 


00 register (Rn, 0 <n< 27) 

01 direct 

10 indirect (disp = 0-255, IRO, IR1) 
11. immediate 


dst register (Rn, 0 < n< 27) 


Opcode 
31 24 23 1615 87 0 
ooorroroofs| a | ve 

Description The bitwise-logical AND of the dst and src operands is formed, but the result 
is not loaded in any register. This allows for nondestructive compares. The dst 
and src operands are assumed to be unsigned integers. 

Cycles 1 

Status Bits These condition flags are modified for all destination registers (R27-RO). 
LUF Unaffected 
LV Unaffected 
UF 0 
N MSB of the output 
Z 1 if a 0 output is generated; 0 otherwise 
Vv 0 
Cc Unaffected 

Mode Bit OVM Operation is not affected by OVM bit value. 
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TSTB Test Bit Fields 


Example TSTB *-AR4(1),R5 


Cc 
Data memory 
8099C4h 
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Before Instruction 


00 0000 0898] 2200 


80 99C5 


Ll 
eT) 
Ld 
Lo 
CL 
eT) 
Lo 


767 


1895 


R5 
AR4 
LUF 

LV 

UF 


oOo = N 2 


8099C4h 


After Instruction 


00 0000 0898} 2200 


[ 
a 
[ 
[ 
[Ld 
[ 


767 


1895 


Syntax 
Operation 


Operands 


Opcode 


Description 


Cycles 


Status Bits 


Mode Bit 


TSTB3 src2, src1 


src? AND src2 


src? 3-operand addressing modes (T): 


00 
01 
10 

1 


register 
indirect 
register 
indirect 


Rni,0< ni <27) 
disp = 0, 1, IRO, IR1) 
Rni,0< ni <27) 
disp = 0, 1, IRO, IR1) 


~~ ~~ 


src2 3-operand addressing modes (T): 


31 


24 23 16 
01 001111| tlo0000 


register (Rn2, 0 < n2 < 27) 
register (Rn2, 0 < n2 < 127) 
indirect (disp = 0, 1, IRO, IR1) 
indirect (disp = 0, 1, IRO, IR1) 


15 


Test Bit Fields, 3-Operand TSTB3 


0 


The bitwise-logical AND between the src7 and src2 operands is formed but is 
not loaded into any register. This allows for nondestructive compares. The 
src? and src2 operands are assumed to be unsigned integers. Although this 
instruction has only two operands, it is designated as a 3-operand instruction 


because operands are specified in the 3-operand format. 


1 


These condition flags are modified for all destination registers (R27-RO). 


LUF 
LV 


Unaffected 
Unaffected 


MSB of the output 
1 if a O output is generated; 0 otherwise 


Unaffected 
Operation is not affected by OVM bit value. 
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TSTB3 Test Bit Fields, 3-Operand 


Example 1 TSTB3 *AR5-- (IRO) , *+ARO (1) 

Before Instruction After Instruction 

ARO ARO 

ARS ARS 

IRO IRO 

LuF [oo LuF [Lo] 

1) er vw Lo] 

UF [ ve [a] 

N 9] N C___] 

2 = — = Zz 

ya v Lo] 

c fd] ¢ [>] 

Data memory 
809885h a98] 2200 809885h 2200 
80992Dh 1895 80992Dh 767] 1895 
Example 2 TSTB3 R4,*AR6-- (IRO) 

Before Instruction After Instruction 

R4 R4 

AR6 ARG 

IRO IRO 

os | LuF [_] 

vw Lo] vw Lo] 

UF Lo UF [| 

N [3] nC gj 

zo 9 2 i _——— 4 

v Lo] v Lo] 

c Tol oe ey 

Data memory 
8099F8h 8099F8h 
Note: Cycle Count 


See subsection 8.5.2, Data Loads and Stores, on page 8-24 for the effects 


of operand ordering on the cycle count. 
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Syntax 
Operation 


Operands 


Opcode 


Description 


Cycles 


Status Bits 


Mode Bit 


Example 


Bitwise-Exclusive OR XOR 


XOR src, dst 
dst XOR src > dst 
src general addressing modes (G): 


00 register (Rn, O< n< 27) 

01 direct 

10 __ indirect (disp = 0-255, IRO, IR1) 
11 immediate 


dst register (Rn, O< n< 27) 


31 2423 1615 87 0 


The bitwise-exclusive OR of the src and dst operands is loaded into the dst 
register. The dst and src operands are assumed to be unsigned integers. 


{ 
These condition flags are modified only if the destination register is R7—-RO. 


LUF Unaffected 

LV Unaffected 

UF 0 

N MSB of the output 

Z 1 if a O output is generated; 0 otherwise 
Vv 0 

Cc Unaffected 


OVM Operation is not affected by OVM bit value. 


XOR R1,R2 

Before Instruction After Instruction 
Rt Rt 
R2 R2 
lure [CO Lue [a 
w [ol w Loo 
UF [Od UF [ 
N Lo] N Co 
z C9 a) 
i Cd 
c Lo c Lo 
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XOR3 | Bitwise-Exclusive OR, 3-Operand 


Syntax 
Operation 


Operands 


Opcode 


Description 


Cycles 


Status Bits 


Mode Bit 
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XOR3 src2, src1, dst 
src1 XOR src2 > dst 
src? 3-operand addressing modes (T): 


00 _~ register 
01 _ indirect 
register 
indirect 


Rni,0< ni <27) 
disp = 0, 1, IRO, IR1) 
Rni, 0 <n < 27) 
disp = 0, 1, IRO, IR1) 


src2 3-operand addressing modes (T): 


00 register (Rn2, 0 < n2 < 27) 
01 register (Rn2, 0 < n2 < 27) 
indirect (disp = 0, 1, IRO, IR1) 


—_~oa~ 


indirect (disp = 0, 1, IRO, IR1) 


dst register (Rn, 0 < n< 27) 


31 24 23 1615 87 0 


The bitwise-exclusive OR between the src? and src2 operands is loaded into 
the dst register. The src1, src2, and dst operands are assumed to be unsigned 
integers. 


1 
These condition flags are modified only if the destination register is R7—RO. 


LUF Unaffected 
LV Unaffected 


UF 0 

N MSB of the output 

Z 1 if a O output is generated; 0 otherwise 
Vv 0 

Cc Unaffected 


OVM Operation is not affected by OVM bit value. 


Example 1 XOR3 *AR3++(IRO),R7,R4 
Before Instruction After Instruction 
Rd Rd 
R7 R7 
AR3 AR3 
IRO IRO 
uF [| LuF [| 
wv [ol w [oo 
u u (sd 
NC nC dj 
z{[ z{[ 
7 v9 
cL cL 
Data memory 
809800h 809800h 
Example 2 XOR3 R5,*-AR1(1),R1 
Before Instruction After Instruction 
R1 R1 
R5 R5 
ARI ARI 
LurF [ LurF [_ 
wv [ol w [oo 
UF [_] UF [ 
N C9] nC 
zc zd] 
v Lo v Lol 
c Ld cfd 
Data memory 

809825h 809825h 

Note: Cycle Count 


See subsection 8.5.2, Data Loads and Stores, on page 8-24 for the effects 


Bitwise-Exclusive OR, 3-Operand XOR3 


of operand ordering on the cycle count. 
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XOR3||STI 


Syntax 


Operation 


Operands 


Opcode 


Description 


Cycles 
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Parallel XOR3 and STI 


XOR3 ~ src2, src1, dst? 
|| STI src3, dst2 


src? XOR src2 > dst1 
|| src3 > dsi2 


src? register (Rn1,0<n1 <7) 
src2__ indirect (disp = 0, 1, IRO, IR1) 
dst? register (Rn2, 0 < n2 <7) 
src3_ register (Rn8, 0 < n3 <7) 
dst2 indirect (disp = 0, 1, IRO, IR1) 


This instruction’s operands have been augmented in the following devices: 


(J ’C31 silicon revision 6.0 or greater 
(J ’C32 silicon revision 2.0 or greater 
Lj ‘VC33 


src? register (Rn1,0<n1 <7) 

src2__ indirect (disp = 0, 1, IRO, IR1) or any CPU register 
dst? register (Rn2,0 < n2<7) 

src3_ register (Rn8, 0< n3 <7) 

dsi2—indireet disp—0,4 IROARH 


31 2423 1615 8 7 0 


A bitwise-exclusive XOR and an integer store are performed in parallel. All reg- 
isters are read at the beginning and loaded at the end of the execute cycle. If 
one of the parallel operations (STI) reads from a register and the operation be- 
ing performed in parallel (XOR3) writes to the same register, STI accepts as 
input the contents of the register before it is modified by the XOR3. 


If src2 and dst2 point to the same location, src2 is read before the write to dst2. 


{ 


Status Bits 


Mode Bit 


Example 


Parallel XOR3 and STI 


XOR3||STI 


These condition flags are modified only if the destination register is R7—RO. 


LUF 
LV 
UF 
N 

Z 

V 

Cc 


Unaffected 
Unaffected 
0 


MSB of the output 


1 if a 0 output is generated; 0 otherwise 


0 
Unaffected 


OVM Operation is not affected by OVM bit value. 


Data memory 
80987Eh 
8098ACh 


XOR3 
STI 


Before Instruction 


*AR1++,R3,R3 
R6,*-AR2 (IRO) 


R3 
220 R6 
AR1 


80987Eh 
8098ACh 


After Instruction 
00 0000 0000 
00 0000 00D 


80 987F 


8 


YU 


OD 


220 


220 


Note: 


Cycle Count 


See subsection 8.5.2, Data Loads and Stores, on page 8-24 for the effects 
of operand ordering on the cycle count. 
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Appendix A 


Instruction Opcodes 


The opcode fields for all TMS320C8x instructions are shown in Table A-1. Bits 
in the table marked with a hyphen are defined in the individual instruction 
descriptions (See Chapter 13, Assembly Language Instructions). Table A-1, 
along with the instruction descriptions, fully defines the instruction words. The 
opcodes are listed in numerical order. Note that an undefined operation may 
occur if an illegal opcode is executed. (An Illegal opcode can only be generated 
by the misuse of the TMS320 floating-point software tools, by an error in the 
ROM code, or by a defective RAM.) 


A-1 


Instruction Opcodes 


Table A-1. TMS320C3x Instruction Opcodes 
30 


Instruction 
ABSF 


ABSI 
ADDC 
ADDF 
ADDI 
AND 
ANDN 
ASH 
CMPF 
CMPI 
FIX 
FLOAT 
IDLE 
IDLE2 
LDE 
LDF 
LDFI 
LDI 
LDII 
LDM 
LDP 
LSH 
LOPOWER 
MAXSPEED 
MPYF 


tT The opcode is the same for standard and delayed instructions. 
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Instruction Opcodes 


Table A-1. TMS320C3x Instruction Opcodes (Continued) 


‘Instruction 31.30.2987 6 Hs 
MPYI 0 0 0 0 1 0 1 0 1 
NEGB 0 0 0 0 1 0 1 1 0 
NEGF 0 0 0 0 1 0 1 1 1 
NEGI 0 0 0 0 1 1 0 0 0 
NOP 0 0 0 0 1 1 0 0 1 
NORM 0 0 0 0 1 1 0 1 0 
NOT 0 0 0 0 1 1 0 1 1 
POP 0 0 0 0 1 1 1 0 0 
POPF 0 0 0 0 1 1 1 0 1 
PUSH 0 0 0 0 1 1 1 1 0 
PUSHF 0 0 0 0 1 1 1 1 1 
OR 0 0 0 1 0 0 0 0 0 
RND 0 0 0 1 0 0 0 1 0 
ROL 0 0 0 1 0 0 0 1 1 
ROLC 0 0 0 1 0 0 1 0 0 
ROR 0 0 0 1 0 0 1 0 1 
RORG 0 0 0 1 0 0 1 1 0 
RPTS 0 0 0 1 0 0 1 1 1 
STF 0 0 0 1 0 1 0 0 0 
STF 0 0 0 1 0 1 0 0 1 
STI 0 0 0 1 0 1 0 1 0 
STII 0 0 0 1 0 1 0 1 1 
SIGI 0 0 0 1 0 1 1 0 0 
SUBB 0 0 0 1 0 1 1 0 1 
SUBC 0 0 0 1 0 1 1 1 0 
SUBF 0 0 0 1 0 1 1 1 1 
SUBI 0 0 0 1 1 0 0 0 0 


t The opcode is the same for standard and delayed instructions. 
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Instruction Opcodes 


Table A-1. TMS320C3x Instruction Opcodes (Continued) 
30 


Instruction 
SUBRB 


SUBRF 
SUBRI 
TSTB 
XOR 
IACK 
ADDC3 
ADDF3 
ADDI3 
AND3 
ANDN3 
ASH3 
CMPF3 
CMPI3 
LSH3 
MPYF3 
MPYI3 
OR3 
SUBB3 
SUBF3 
SUB13 
TSTB3 
XOR3 
LDFcond 
LDIcond 
BR(D)t 
CALL 


t The opcode is the same for standard and delayed instructions. 
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Instruction Opcodes 


Table A-1. TMS320C3x Instruction Opcodes (Continued) 


‘Instruction 31. 30.—s29.sB:s—i sisi HC 
RPTB 0 1 1 0 0 1 0 - - 
SWI 0 1 1 0 0 1 1 - - 
Bcond(D)t 0 1 1 0 1 0 - - - 
DBcond(D)t 0 1 1 0 1 1 - - - 
CALLcond 0 1 1 1 0 0 - - - 
TRAP cond 0 1 1 1 0 1 0 - - 
RETIcond 0 1 1 1 1 0 0 0 0 
RETScond 0 1 1 1 1 0 0 0 1 
MPYF3||ADDF3 1 0 0 0 0 0 0 0 - 

1 0 0 0 0 0 0 1 - 
1 0 0 0 0 0 1 0 - 
1 0 0 0 0 0 1 1 - 
MPYF3]||SUBF3 1 0 0 0 0 1 0 0 - 
1 0 0 0 0 1 0 1 - 
1 0 0 0 0 1 1 0 - 
1 0 0 0 0 1 1 1 - 
MPYI3||ADDI3 1 0 0 0 1 0 0 0 - 
1 0 0 0 1 0 0 1 - 
1 0 0 0 1 0 1 0 - 
1 0 0 0 1 0 1 1 - 
MPYI3||SUBI3 1 0 0 0 1 1 0 0 - 
1 0 0 0 1 1 0 1 - 
1 0 0 0 1 1 1 0 - 
1 0 0 0 1 1 1 1 - 
STF||STF 1 1 0 0 0 0 0 - - 
STI||STI 1 1 0 0 0 0 1 - - 
LDF||LDF 1 1 0 0 0 1 0 - - 
LDI||LDI 1 1 0 0 0 1 1 - - 
ABSF||STF 1 1 0 0 1 0 0 - - 


t The opcode is the same for standard and delayed instructions. 
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Table A-1. TMS320C3x Instruction Opcodes (Continued) 


Instruction 31 30 29 28 #27 +26 25 
ABSI||STI 1 1 0 0 1 0 1 
ADDF3||STF 1 1 0 0 1 1 0 
ADDI3]|STI 1 1 0 0 1 1 1 
AND3||STI 1 1 0 1 0 0 0 
ASH3||STI 1 dl 0 1 0 0 1 
FIX||STI 1 1 0 1 0 1 0 
FLOAT||STF 1 1 0 1 0 1 1 
LDF||STF 1 1 0 1 1 0 0 
LDI||STI 1 1 0 1 1 0 1 
LSH3]|STI 1 Al 0 1 1 1 0 
MPYF3||STF 1 1 0 1 1 1 1 
MPYI3||STI 1 1 1 0 0 0 0 
NEGF||STF 1 1 1 0 0 0 1 
NEGI||STI 1 1 1 0 0 1 0 
NOT||STI 1 1 1 0 0 1 1 
OR3]|STI 1 1 1 0 1 0 0 
SUBF3||STF 1 1 1 0 1 0 1 
SUBI3||STI 1 1 1 0 1 1 0 
XOR93\|STI 1 1 1 0 1 1 1 
Reserved for reset, 0 1 1 1 1 1 1 


traps, and interrupts 


t The opcode is the same for standard and delayed instructions. 
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TMS320C31/VC33 Boot Loader Source Code 


This appendix contains the source code for the ’C31/VC33 boot loader. 


aa: | 


Note: VC33 


The TMS320VC33 is logically and functionally a superset of the TMS320C31 
making nearly all literature and documentation for the TMS320C31 applica— 
ble. However, the electrical characteristics are different and are covered in 
detail in the TMS320VC33 data sheet (literature number SPRS087). 


a) 
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* C31BOOT - TMS320C31 BOOT LOADER PROGRAM 

* (C) COPYRIGHT TEXAS INSTRUMENTS INC., 1990 

* 

om NOTE: 1. AFTER DEVICE RESET, THE PROGRAM IS SET TO WAIT FOR 

* THE EXTERNAL INTERRUPTS. THE FUNCTION SELECTION OF 

7 THE EXTERNAL INTERRUPTS IS AS FOLLOWS: 

* a ae aa es a ja a a ea a i a a ee oF aes ee (ari aces 

* INTERRUPT PIN FUNCTION 

* cree eccentric ps re ee penetra fae tro) 

* 0 EPROM boot loader from 1000H 

* SS ed era ae SS eet, facet, Sapes et a ee ee ee ees ee a, Ser Seas SES Sates 

oe 1 EPROM boot loader from 400000H 

* Ss eee ei i a fa ce Mss, Fear ean ne er aes 

* 2 EPROM boot loader from FFFOO0OH 

* ee a carne esc es ae ok nee psa eset 

* 3 Serial port 0 boot loader 

* Sedans aS es ee. jes ea ee Ses aes a a Se a a, Siete ete Ber as Si et a 

* 

* 2. THE EPROM BOOT LOADER LOADS WORD, HALFWORD, OR BYTE- 

* WIDE PROGRAMS TO SPECIFIED LOCATIONS. THE 

* 8 LSBs OF FIRST MEMORY SPECIFY THE MEMORY WIDTH OF 

7 THE EPROM. IF THE HALFWORD OR BYTE-WIDE PROGRAM IS 

* SELECTED, THE LSBs ARE LOADED FIRST, FOLLOWED BY THE MSBs. 
* THE FOLLOWING WORD CONTAINS THE CONTROL WORD FOR 

- THE LOCAL MEMORY REGISTER. THE PROGRAM BLOCKS FOLLOW. 

as THE FIRST TWO WORDS OF EACH PROGRAM BLOCK CONTAIN 

* THE BLOCK SIZE AND MEMORY ADDRESS TO BE LOADED INTO. 

- WHEN THE ZERO BLOCK SIZE IS READ, THE PROGRAM BLOCK 

* LOADING IS TERMINATED. THE PC WILL BRANCH TO THE 

* STARTING ADDRESS OF THE FIRST PROGRAM BLOCK. 

* 

* 3. IF SERIAL PORT 0 IS SELECTED FOR BOOT LOADING, THE 

* PROCESSOR WILL WAIT FOR THE INTERRUPT FROM THE 

* RECEIVE SERIAL PORT 0 AND PERFORM THE DOWNLOAD. 

* AS WITH THE EPROM LOADER, PROGRAMS CAN BE LOADED 

* INTO DIFFERENT MEMORY BLOCKS. THE FIRST TWO WORDS OF EACH 
* PROGRAM BLOCK CONTAIN THE BLOCK SIZE AND MEMORY ADDRESS 
* TO BE LOADED INTO. WHEN THE ZERO BLOCK SIZE IS READ, 

* PROGRAM BLOCK LOADING IS TERMINATED. IN OTHER WORDS, 

* IN ORDER TO TERMINATE THE PROGRAM BLOCK LOADING, 

* A ZERO HAS TO BE ADDED AT THE END OF THE PROGRAM BLOCK. 
* AFTER THE BOOT LOADING IS COMPLETED, THE PC WILL BRANCH 
* TO THE STARTING ADDRESS OF THE FIRST PROGRAM BLOCK. 


* 
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reset 
into 
intl 
int2 
int3 
xinto 
rinto 


tinto 
tintl 
dint 


KEKKKKKEKKKREKRK KEKE KEKKR KEK KERR KEKE KKK KEKE KER KKK KERR KEKE KERR KKK KERR KEKE KK RRR KEKE KRKKEKEKKKKEKKEKEE 


trapo 
trapl 
trap2 
trap3 
trap4 
trap5 
trapé6 
trap7 
trap8 
trap9 
traplo. 


.word 
.word 
.word 
.word 
.word 
.word 
.word 
.word 
.word 
.word 


word 


-global check 


.sect 
.word 
.word 
.word 
.word 
.word 
.word 
.word 
.word 
. word 
.word 
. word 
.word 
.word 
.word 
.word 
.word 
.word 
.word 
.word 
.word 
.word 
.word 
.word 
. word 
.- word 
. word 
.word 
.word 
.word 
.word 
.word 
.word 


“vectors” 
check 

809FC1 
809FC2 
809FC3 
809FC4 
809FC5 
809FC6 
809FC7 
809FC8 
809FC9 
809FCA 
809FCB 
809FCC 
809FCD 
809FCE 
809FCF 
809FDO 
809FD1 
809FD2 
809FD3 
809FD4 
809FD5 
809FD6 
809FD7 
809FD8 
809FD9 
809FDA 
809FDB 
809FDC 
809FDD 
809FDE 
809FDF 
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trapll.word 809FEBh 
trapl2 .word 809FECh 
trapl3 .word 809FEDh 
trapl4 .word 809FEEh 
trap15 .word 809FEFh 
trapl6 .word 809FFOh 
trapl7.word 809FFih 
trapl8 .word 809FF2h 
trapl9 .word 809FF3h 
trap20.word 809FF4h 
trap21.word 809FF5h 
trap22 .word 809FF6h 
trap23 .word 809FF7h 
trap24 .word 809FF8h 
trap25 .word 809FF9h 
trap26 .word 809FFAh 
trap27 .word 809FFBh 

-word 809FFCh 

-word 809FFDh 

-word 809FFEh 

-word 809FFFh 


KREKKKEKKEKKKRKKKEKKKK KEK KKRE KEK KEKE KKK KEK KERR KEKE KKK KEKE RK KKK KKK KKK KKK KKK KKK KEKKKKEKKKEKEE 


-Space 5 
check: LDI 4040h, ARO ; load peripheral mem. map 

LSH 9,ARO ; start addr. 808000h 

LDI 404Ch, SP ; initialize stack pointer to 

LSH 9,SP ; ramO addr. 809800h 

LDI 0,RO ; set start address flag off 
intloop TSTB 8,1F ; test for ext int3 

BNZ serial ; on int3 go to serial 

LDI 8,AR1 ; load 001000h / 2*9 -> ARI 

TSTB 1,1F ; test for into 

BNZ eprom_load ; branch to eprom_load if intO = 1 

LDI 2000h,AR1 ; load 400000h / 2*9 -> ARI 

TSTB 2,1F ; test for intl 

BNZ eprom_load ; branch to eprom_load if intl = 1 

LDI 7FF8h,AR1 ; load FFFOO0h / 2*9 -> ARI 

TSTB 4,1F ; test for int2 

BZ intloop ; if no intX go to intloop 
eprom_load LSH 9,AR1 ; eprom address = AR1 * 2%°9 

LDI *AR14++(1),R1 ; load eprom mem. width 

LDI sub_w,AR3 ; full-word size subroutine 

; address -> AR3 
LSH 26,R1 ; test bit 5 of mem. width word 
BN loado ; if ‘1’ start PGM loading 


; (32 bits width) 
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loado 


load2 


serial 


loop_s 
sub_s 


NOP 
LDI 


LSH 
BN 


LDI 
ADDI 


CALLU 


STI 


CALLU 


LDI 
CMPI 
BZ 
SUBI 


CALLU 


LDI 
LDI 
LDIZ 
LDI 
SUBI 


LDI 
ADDI 
BR 


-Space 1 


LDI 


LDI 
STI 
LDI 
LSH 
STI 


BR 


*AR1+4++4+ (1) 
sub_h, AR3 


ip Rel: 
loado 


sub_b, AR3 
2,AR1 


AR3 


R1,*+ARO (64h) 


AR3 


R1,RC 
0,RC 
AR2 
1,RC 


AR3 


R1,AR4 
RO,RO 
R1,AR2 
-1,R0 
1,AR3 


CALLUAR3 


TARO: 
1,AR3 
load2 


sub_s,AR3 


111h,R1 
R1,*+ARO (43h) 
0A30h,R2 
16,R2 
R2,*+ARO (40h) 


load2 


-Space 29 


RPTB 
TSTB 
BZ 
AND 


load_s 
20h,1IF 
sub_s 
OFDFh, IF 


TMS320C31 Boot Loader Source Code 


jump last half word from mem. word 
half word size subroutine 

address -> AR3 

test bit 4 of mem. width word 

if ‘1’ start PGM loading 

(16 bits width) 


byte size subroutine address -> AR3 
jump last 2 bytes from mem. word 


load new word 
according to mem. width 
set primary bus control 


load new word according to 
mem. width 

set block size for repeat loop 
if 0 block size start PGM 


block size -1 


load new word according to 

mem. width 

set destination address 

test start address loaded flag 
load start address if flag off 
set start & dest. address flag on 
sub address with loop 


load new word according to 
mem. width 

set dest. address flag off 
sub address without loop 
jump to load a new block 
when loop completed 


serial words subroutine 

address -> AR3 

R1 = 0000111h 

set CLKR,DR,FSR as serial port pins 


R2 = A300000h 
set serial port global 
ctrl. register 
jump to load 1st block 


PGM load loop 


wait for receive buffer full 
reset interrupt flag 
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load_s 
end_s 


loop _h 
sub_h 


load_h 
end_h 


loop _w 
sub_w 


load_w 
end_w 


loop_b 
sub_b 


load_b 
end_b 
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LDI *+ARO(4Ch) ,R1 

LDI RO,RO 

BNN end_s 

STI R1, *AR4++ (1) 

RETSU 

.space 22 

RPTB load_h 

LDI *AR1++(1),R1 

AND OFFFFh,R1 

LDI *AR1++(1),R2 

LSH 16,R2 

OR R2,R1 

LDI RO,RO 

BNN end_h 

STI R1, *AR4++ (1) 

RETSU 

.Space 26 

RPTB load_w 

LDI *AR1++(1),R1 

LDI RO,RO 

BNN end_w 

STI R1, *AR4++ (1) 

RETSU 

.Space 14 

RPTB load_b 

LDI *AR1++(1),R1 

AND OFFh,R1 

LDI *AR1++(1),R2 

AND OFFh,R2 

LSH 8,R2 

OR R2,R1 

LDI *AR1++(1),R2 

AND OFFh,R2 

LSH 16,R2 

OR R2;.Ri 

LDI *AR1++(1),R2 
LSH 24,R2 

OR R2,R1 

LDI RO,RO 

BNN end_b 

STI R1, *AR4++ (1) 

RETSU 

-Space 1 

.end 


test load address flag 


store new word to dest. 
return from subroutine 


PGM load loop 
load LSB half word 


load MSB half word 


Rl = a new 32-bit word 
test load address flag 


store new word to dest. 
return from subroutine 


PGM load loop 
read a new 32-bit word 
test load address flag 


store new word to dest. 
return from subroutine 


PGM load loop 


load 1st byte ( LSB ) 


load 2nd byte 


load 3rd byte 
load 4th byte ( MSB ) 


Rl = a new 32-bit word 
test load address flag 


store new word to dest. 
return from subroutine 


address 


address 


address 


address 
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TMS320C32 Boot Loader Source Code 


This appendix includes a description of the ‘C32 boot loader sequence of 


events and a listing of its source code. 
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Boot-Loader Source Code Description 


C.1 Boot-Loader Source Code Description 
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Figure C—1 shows the boot loader program flow chart. The boot loader pro- 
gram starts by initializing three registers: AR7, SP, and /RO. These registers 
hold the peripheral bus memory map register base address, the timer counter 
register (used as a stack), and a flag that indicates the first block, respectively. 
Then, the program checks for serial port boot load or memory boot load mode 
by processing the bit fields set in the interrupt flag register (/F). For a serial port 
boot load, the program initializes the serial port for 32-bit fixed-burst-mode 
reads with an externally generated serial port clock and FSR. 


For a memory boot load, AR3 is set to the boot source address, AR2 points 
to the boot source strobe control register, and R2 contains the value that is 
stored in this strobe control register. The boot loader also sets the bit field 1/ 
OXFO of the I/O flag register (IOF) if the handshake mode was selected. Then 
the boot loader reads the first word of the boot source program. This 32-bit 
word indicates the boot memory width and the boot load program stores this 
value in R5. ARO points to the read_mc routine that performs this read. 


After reading the memory width word, the boot loader reads IOSTRB, STRBO, 
and STRB1 control register values of the source program. These values are 
temporarily saved in the DMA source address register, DMA destination ad- 
dress register, and DMA transfer counter registers, respectively. Then, the 
program reads the block size with the read_mc routine. If the block size is 0, 
the boot loader restores the values of IOSTRB, STRBO, and STRB1 previously 
saved and branches to the destination address of the first block loaded and 
begins program execution. If the block size is not 0, the boot loader stores the 
block size in the BK register. This is used as a counter in a repeat block (RPTB) 
to transfer all the data or program in that block. 


For each block, the boot loader reads the destination address and the destina- 
tion strobe control word. The program stores the destination address in the 
AR5d register. The destination strobe control word includes the destination 
strobe identification, the contents of the destination strobe control register 
(includes memory width and data size). The boot loader extracts this informa- 
tion from the destination control word and stores the destination strobe-control 
register memory-mapped address in the AR4 register, the contents of the des- 
tination strobe control register in the R4 register, and the source data size in 
the R3 register. The boot loader sets the AR7 register to the appropriate read 
routine read_sO for serial port boot load and read_mb for memory boot load. 
The read routine uses these registers to control the transfer of a block of data 
or program. 


Figure C-1. Boot-Loader Flow Chart 


Interrupt flag IF 


Memory width: R5 
; Memory control word ; 
read routine: ARO 


Serial initialize 
serial global 
control register 


Boot-Loader Source Code Description 


Initialize registers: 
AR7, SP, IRO 


Boot source 
address: AR3 
Boot strobe 
pointer: AR2 

Boot strobe value: 
R2 
Handshake mode: 
IOF 


Process intertupts 
INTO, INT1, INT2 


Process memory 
width word 


Read and store 
strobe values 


Yes 


Restore strobe 


values previously 
saved 


Start 
program execution 


Pid 
Read block size Block size RC 


No 
Read destination Destination 
address address: AR3 
Destination strobe 
pointer: AR4 
Destination strobe 
value: R4 
Destination data 
size: R3 
Memory block read 
routine: AR1 
Memory width: 
R5 


Read destination 
strobe 


Select read 
routine 
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C.2 Boot-Loader Source Code Listing 
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* C32BOOT - TMS320C32 BOOT LOADER PROGRAM (143 words) March-96 


+ FF F F F 


5 i i eo 


te FF FH FF HF HF FF HF F FF FF HF HF HF FF HF HF HF F 


° 
KR 


(C) COPYRIGHT TEXAS INSTRUMENTS INCORPORATED, 1994 Vi27 


Following device reset, the program waits for an external interrupt. 
The interrupt type determines the initial address from which the boot 
loader will start loading the boot table to the destination memory: 


1000h (STRBO) 
810000h (IOSTRB) 


900000h (STRB1) 


80804Ch (sportO Rx) 


1000h (STRBO) ASYNC 
810000h (IOSTRB) ASYNC 
900000h (STRB1) ASYNC 


If INT3 is asserted together with (INT2 or INT1 or INTO) following reset, 
that indicates that the boot table is to be read asynchronously from EPROM 
using pins XFO and XF1 for handshaking. The handshaking protocol assumes 
that the data ready signal generated by the host arrives through pin XF1. 
The data acknowledge signal is output from the C32 on pin XFO.Both signals 
are active low. The C32 will continuously toggle the IACK signal while 
waiting for the host to assert data ready signal (pin XF1). 


The boot operation involves transfer of one or more source blocks from the 
boot media to the destination memory. The block structure of the boot table 
serves the purpose of distributing the source data/program among different 
memory spaces. Each block is preceded by several 32-bit control words de 
scribing the block contents to the boot loader program. 


When loading from serial port, the boot loader reads the source data/program 
and writes it to the destination memory. There is only one way to read the 
serial port. When loading from EPROM, however, there are 4 ways to read and 
assemble the source contents, depending on the width of boot memory and the 
size of the program/data being transferred. Because there is a possibility 
that reads and writes can span the same STRB space the boot loader loads the 
appropriate STRB control registers before each read and write. 


If the boot source is EPROM whose physical width is less than 32 bits, the 
physical interface of the EPROM device(s) to the processor should be the 
same as that of the 32-bit interface. (This involves a specific connection 
to C32’s strobe and address signals). The reason for such arrangement is 


5s 
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that to function properly, the boot loader program always expects 32-bit 
data from 32-bit wide memory during the boot load operation. Valid boot 
EPROM widths are : 1, 2, 4, 8, 16 and 32 bits. 


A single source block cannot cross STRB boundaries. For example, its 
destination cannot overlap STRBO space and IOSTRB space. Additionally, all 
of the destination addresses of a single source block should reside in 
physical memory of the same width. It is also not permitted to mix prg and 
data in the same source block. 


The boot loader stops boot operation when it finds 0 in the block size 
control word. Therefore, each boot table should always end with a 0, 
prompting the boot loader to branch to the first address of the first block 
and start program execution from that location. 


boot loader program register assignments, and altered mem locations 


Kaen eee eee = ee ee en 


+ oF FF FO OF OH OH OH OH OH OH FF FF FH FH HF HF HF HF HF FF OH HH HK HK HK HF HF HK 


- peripheral memory map IOF - XFO (handshake 0O) 
- read cntrl data subr pointer IOF - XF1 (handshake I) 
- read block data/prg subr pointer 


- read STRB value R4 - write STRB value 
- read STRB pointer AR4 - write STRB pointer 
- read data/prg pointer ARS - write data/prg pointer 


read --> R1 --> write 


- EXEC start flag stack - 808024h - TIMO cnt reg 
- EXEC start address 808028h - TIMO per reg 

IOSTRB - 808004h - DMAO dst reg 
- data SIZE STRBO - 808006h - DMAO dst reg 
—- mem WIDTH STRB1 - 808008h - DMAO cnt reg 


- memory read value AR6,R7,RO,BK - scratch registers 


.word start ; reset vector 
. space 44h ; program starts @45h 


* Init registers 808000h --> AR7, 808023h --> SP, -1 --> IRO 
8 a SS a SS SS EE EE F 
start LDI 4040h,AR7 ; load peripheral memory map 

LSH 9,AR7 ; base address = 808000h 

LDI 23h,SP ; initialize stack pointer to 

OR AR7,SP ; 808023h (timer counter - 1) 

LDI -1,IRO ; reset exec start addr flag 
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Test for INT3 and, 
load AR3 with 1000h if INTO, 


if set exclusively, 


810000h if INT1 900000h if INT2. 


Also load 


proceed with serial boot load. Else, 


, 


--> AR2 and force the boot strobe value to 
(INTO or INT1 or INT2) and INT3, turn on the 


appropriate boot strobe pointer 
reflect 32bit memory width. If 
handshake mode. 


t+ Ft FF 


waitl LDI IF,RO 
AND OFh, RO ; clean 
CMPI 8,RO ; test for INT3 
BEQ serial pee: Serial boot load mode 
LDI AR7,AR2 
ADDI 60h, AR2 ; 808060h (IOSTRB) --> AR2 
TSTB 2,R0 ; test for INT1 
LDINZ 4080h, AR3 ; 810000h / 2**9 
BNZ exit3 pRRRKKKE 
ADDI 4,AR2 ; 808064h (STRBO) --> AR2 
TSTB 1,RO ; test for INTO 
LDINZ 8,AR3 ; 001000h / 2**9 
BNZ exit3 pRRRRRKK 
ADDI 4,AR2 ; 808068h (STRB1) --> AR2 
TSTB 4,R0 ; test for INT2 
LDINZ 4800h, AR3 ; 900000h / 2**9 
BZ waitl p RR RR KEK 
exit3 TSTB 8,RO ;*; test#1l - INT3 asserted 
BZ exit2 ;*; test#2 - INXF1 low (not used) 
TSTB 80h, IOF ;*; enable handshake mode if 
LDI 6,10F ;*; test#l passed 
exit2 LDI OFh, R2 
LSH 16,R2 ; force boot data size to 32 
OR *AR2,R2 ; force boot mem width to 32 
STI R2,*AR2 
LSH 9,AR3 ; boot mem start addr --> AR3 
* xx000001 -— 1 bit 
#oSoo SS SSSS SSS SSS SSS SSS SSS SS SS SSS Se SS SSeS SSS SeS= ¥KO00010 = 2 bit 
* Process MEMORY WIDTH control word (32 bits long) xx000100 - 4 bit 
*¥SsoSsaSSSe SSS SSeS SSeS SSS Ssae ees eee SSeasSses== ¥KX001000 —- 8 bit 
* xx010000 - 16 bit 
* xx100000 - 32 bit 
LDI read_mc, ARO ; use memory to read cntrl words 
F read_mc --> ARO 
LDI 1,R5 ; mem width = 1 (init) 
LDI 32,AR6 ; mem reads = 32 (init) 
CALLU read _m ; read memory once (lst read) 
loop2 TSTB 1,R6 
BNZ label4 
LSH -1,R6 ; look at next bit 
LSH -1,AR6 ; decr mem reads 
LSH 1,R5 ; incr mem width --> RS5 
BU loop2 p RR RR RRR 
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label4 SUBI 2,AR6 
CMPI 0,AR6 ; set flags 
BN strobes ;*******; total # of mem reads = 32/R5 
label5 CALLU read_m ; read memory once 
DBU AR6,label5 pReee 
#SSsoo SS Se SSeS Sa See SSS SS Sea SS See SSS SSS SSeS SSS eS SS Se Se SSeS SSS SSS SSeS SSeS Sst 
* Read and save IOSTRB, STRBO & STRB1 (to be loaded at end of boot load) 
Fess a SS Sa SaS sees aS SSS SSS SSS aS aS SSS SSS ee SSS See Sasa SSS ease eS SSeSsSeSSse=5=55=* 
strobes CALLU ARO 
STI R1, *+AR7 (4) ; IOSTRB --> (DMA src) 
CALLU ARO 
STI R1,*+AR7 (6) ; STRBO --> (DMA dst) 
CALLU ARO 
STI R1,*+AR7 (8) 7 STRB1 --> (DMA cnt) 
#SSS oS SSS a SSeS SSS Se SS Sa SSS SS SS Ses SSS SSeS Se SSeS SSS SSS SS SSS Sea SSE SS 
* Process block size (# of bytes, half-words, or words after STRB cntrl) 
FSSS SSeS aS Sa SS SSeS Sasa SSS SSeS SS SSS SSS SS Se SSS SSS Sa a Se = SSS SSS SS SSeS SsSo5=555% 
block CALLU ARO ; read boot memory cntrl word 
LDI R1,R1 ; is this the last block ? 
BNZ label2 pReeeeEEE? nO, go around 
LDI *+AR7 (4) ,RO 7 DMA src) 
STL RO, *+AR7 (60h) 7 restore IOSTRB 
LDI *+AR7(6),RO j DMA dst) 
STI RO, *+AR7 (64h) 7 restore STRBO 
LDI *+AR7(8),RO 7 DMA cnt) 
STI RO, *+AR7 (68h) 7 restore STRB1 
BU IR1 j*¥***keee, Dranch to start of program 
label2 LDI R1,RC ; setup transfer loop 
SUBI 1, RC ; RC - 1 --> RC 
SSSseSssSeSsSSsSoesSeSS5 SoS SSS SSeS sae Sessa eS SSeS See Se Sse eas see Sess aeSsseoseeee* 
* Process block destination address, save start address of first block 
Fos Se Se et ee ee ek 
CALLU ARO ; read boot memory cntrl word 
LDI R1,AR5 ; set dest addr --> AR5 
CMPI 0, IRO ; look at EXEC start addr flag 
LDINZ AR5,IR1 ; if -1, EXEC start addr --> IR1 
LDINZ 0,IRO ; set EXEC start addr flag 
ee ee 
* (For internal destination, this word should be 0 or 60h. The first case will 
* result in 0 --> DMA cntrl reg, in second case 0 --> IOSTRB reg) . 
* Process block destination strobe control (sss...sss 0110 xx00) 
* eee 555 SSS SSS SSS SS SSS SS SS SS SS SS S======== Strb value ==== 00 - IOSTRB 
* 01 - STRBO 
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CALLU ARO ; 10 - STRB1 
LDI R1,R4 

AND 6Ch,R1 ; dest mem strb pntr --> AR4 

OR3 AR7,R1,AR4 

LSH -8,R4 ; dest memory strobe --> R4 

LDI R4,R3 

LSH -16,R3 

AND 3,R3 ; dest data size -—-> R3 

TSTB O0Ch,R1 ; (IOSTRB case) 

LDIZ 3,R3 


CMPI 0,R5 
LDIEQ read_s0,AR1 ; read serial porto 
LDINE read_mb,AR1 ; read memory 


*# se sesss sess Se SSeS SSeS Se SSS SS SSS SS SSS SS SS SSS SS SS SSS SS SSS SSS SS SS SS SS SSS SSS SSSSses 
RPTB loop4 
CALLU AR1 ; read data/prg 
STI R4, *AR4 ; set write strobe 
NOP ; pipeline 
loop4 STI R1,*AR5S++ ; write data/prg!!!!!!! lt! 
BU block p***k*kkee: process next block 
IE SS SS SS SS SS SS SS a SS SS SS SS SS SS 8 Se SS SS SS SS 


* Load R5 with 0, load read_sO to ARO and initialize serial port_0 


serial LDI read_s0,ARO ; use serial to read cntrl words 
LDI 0,R5 ; memory WIDTH = serial 
LDI 0O,R ; dummy 
LDI AR7,AR2 ; dummy 
LDI 111h,RO ; 0000111h --> RO 
STI RO, *+AR7 (43h) ; set CLKR,DR,FSR as serial 
LDI 0A30h,R7 7 port pins 
LSH 16,R7 ; A300000h --> R7 
STI R7, *+AR7 (40h) ; set serial global cntrl reg 
BU strobes px*eeeee > process first block 

, ee ee ee ey 


Read a single value from serial or boot memory. The number of 

memory reads depends on mem WIDTH and data SIZE. R1 returns the 

read value. (Serial sim: NOP --> BZ read_sO & LDI @4000H,R1 --> LDI 
*+AR7 (4Ch) ,R1) 


+ + F 


Boot-Loader Source Code Listing 


read_sO TSTB 20h, IF ; look at RINTO flag 

BZ read_s0 ; wait for receive buffer full 

AND OFDFh,IF ; reset interrupt flag 

LDI *+AR7(4Ch) ,R1 ; read data --> R1 

RETSU 
I ca ca a A a a a a ya ea a ea a a a ea ae as a ty 
read_mc LDI 3,R3 ; data size = 32, 3 --> R3 
read_mb LDI 1,BK ; 00000001 (ex: mem width=8) 

LSH R5,BK ; 00000100 

SUBI 1,BK ; OOOOOOFF = mask --> BK 

LDI R3,AR6 ; 0 - 1 000 EXPAND 

ADDI 1,AR6 . 1 - 10 000 DATA --> AR6 

LSH 3,AR6 ; 11 - 100 000 SIZE 

LDI R5,R0 
loop3 CMPI 1,RO0 

BEQ exitl ; DATA SIZE 

LSH -1,R0 poses s<<-- - 1 --> AR6 

LSH -1,AR6 ; MEM WIDTH 

BU loop3 p RR RRR ER 
exitl SUBI 1,AR6 

LDI 0,RO ; init shift value 

LDI 0,R1 ; init accumulator 
loop1 ADDI 3,SP ; 808027h --> SP 

CALLU read_m ; read memory once --> R6 

SUBI 3,SP ; 808024h --> SP 

AND3 R6,BK,R7 ; apply mask 

LSH RO,R7 ; shift 

OR R7,R1 ; accumulate --> R1 

ADDI R5,RO0 ; increment shift value 

DBU AR6,loop1 jp**eee: decrement #of chunks --> AR6 

RETSU 
Nese aS ee eS SS ee ee ee ee ee Ee 
* Perform a single memory read from the source boot table. Handshake enabled if 
* IOXFO bit of IOF reg is set, disabled when reset. IACK will pulse continuously 
* if handshake enabled and data not ready (to achieve zero-glue interface when 
* connecting to a C40 comm-port) 
Te SS SS SS SS SS a SS SS SS SS ee SF 
read _m TSTB 2,10F ; handshake mode enabled ? 

STI R2, *AR2 ; set read strobe !!!!!!liiritt 

BNZ loop5 ; yes, jump over 

LDI *AR3++,R6 7 no, just read memory & return 

RETSU 
*------------------------- ------------------------ -------- (C40) 
loop5 IACK * ART ;*; intrnl dummy read pulses IACK 

TSTB 80h, IOF ;*; wait for data ready 

BNZ loops ;*; (XF1 low from host) 

LDI *AR3++,R6 ;*; read memory once --> R6 
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LDI 2,10F ;*; assert data acknowledge 
;*; (XFO low to host) 


loopé TSTB 80h, IOF ;*; wait for data not ready 
BZ loop6é ;*; (XF1 high from host) 
LDI 6,10F ;*; deassert data acknowledge 


;*; (XFO high to host) 
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Glossary 


AO-A23: External address pins for data/program memory or I/O devices. 
These pins are on the primary bus. 


address: The location of program code or data stored in memory. 


addressing mode: The method by which an instruction interprets its oper- 
ands to acquire the data it needs. 


ALU: Arithmetic logic unit. The part of the CPU that performs arithmetic and 
logic operations. 


analog-to-digital (A/D) converter: A converter with internal sample-and- 
hold circuitry used to translate an analog signal to a digital signal. 


ARAU: Auxiliary-register arithmetic unit. A 32-bit ALU used to calculate indi- 
rect addresses using the auxiliary registers as inputs and outputs. 


arithmetic logic unit (ALU): The part of the CPU that performs arithmetic 
and logic operations. 


auxiliary registers (ARn): A set of registers used primarily in address gen- 
eration. 


auxiliary-register arithmetic unit (ARAU): Auzxiliary-register arithmetic 
unit. A 32-bit ALU used to calculate indirect addresses using the auxiliary 
registers as inputs and outputs. 


bit-reversed addressing: Accessing data from memory, registers, and the 
instruction word by reversing several bits of an address in order to speed 
processing of algorithms, such as Fourier transforms. 
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BK: Block-size register. A 32-bit register used by the ARAU in circular ad- 
dressing to specify the data block size. 


boot loader: An on-chip code that loads and executes programs received 
from a host processor through standard memory devices (including 
EPROM), with and without handshake, or through the serial port to RAM 
at power up. 


carry bit: A bit in the status register (ST) used by the ALU for extended arith- 
metic operations and accumulator shifts and rotates. The carry bit can 
be tested by conditional instructions. 


circular addressing: Accessing data from memory, registers, and the 
instruction word by using an auxiliary register to cycle through a range 
of addresses to create a circular buffer in memory. 


context save/restore: A save/restore of system status (status registers, ac- 
cumulator, product register, temporary register, hardware stack, and 
auxiliary registers, etc.) when the device enters/exits a subroutine such 
as an interrupt service routine. 


CPU: Central processing unit. The unit that coordinates the functions of a 
processor. 


CPU cycle: The time it takes the CPU to go through one logic phase (during 
which internal values are changed) and one latch phase (during which 
the values are held constant). 


ICPU interrupt flag register (IF): A register that contains CPU, serial ports, 
timer, and DMA interrupt flags. 


cycle: See CPU cycle. 


DO-D31: External data-bus pins that transfer data between the processor 
and external data/program memory or I/O devices. See also LDO-LD31. 


data-address generation logic: Circuitry that generates the addresses for 
data-memory reads and writes. This circuitry can generate one address 
per machine cycle. See also program address generation logic. 


data-page pointer: A 32-bit register used as the 8 most significant bits 
(MSBs) in addresses generated using direct addressing. 


Glossary 


data size: The number of bits (8, 16, or 32) used to represent a particular 
number. 


decode phase: The phase of the pipeline in which the instruction is decoded 
(identified). 


DMA coprocessor: A peripheral that transfers the contents of memory loca- 
tions independently of the processor (except for initialization). 


DMA coniroller: See DMA coprocessor. 
DP: See data-page pointer. 


dual-access RAM: Memory that can be accessed twice in a single clock 
cycle. For example, code that can read from and write to a RAM in one 
clock cycle. 


external interrupt: A hardware interrupt triggered by a pin. 


extended-precision floating-point format: A 40-bit representation of a 
floating-point number with a 32-bit mantissa and an 8-bit exponent. 


extended-precision register: A 40-bit register used primarily for extended- 
precision floating-point calculations. Floating-point operations use bits 
39-0 of an extended-precision register. Integer operations, however, use 
only bits 31-0. 


FIFO buffer: First-in, first-out buffer. A portion of memory in which data is 
stored and then retrieved in the same order in which it was stored. Thus, 
the first word stored in this buffer is retrieved first. 


hardware interrupt: An interrupt triggered through physical connections 
with on-chip peripherals or external devices. 


hit: A condition in which, when the processor fetches an instruction, the 
instruction is available in the cache. 
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IACK: = /nterrupt acknowledge signal. An output signal indicating that an in- 
terrupt has been received and that the program counter is fetching the 
interrupt vector that will force the processor into an interrupt service rou- 
tine. 


IE: See internal interrupt enable register. 


I/O flag (IOF) register: Controls the function (general-purpose I/O or inter- 
rupt) of the external pins. It also contains timer/DMA interrupt flags. 


index registers: Two 32-bit registers (IRO and IR1) that are used by the 
ARAU for indexing an address. 


internal interrupt: A hardware interrupt caused by an on-chip peripheral. 


internal interrupt enable register: A register (in the CPU register file) that 
determines whether the CPU or DMA responds to interrupts from exter- 
nal interrupt pins, the serial ports, the timers, and the DMA coprocessor. 


interrupt: A signal sent to the CPU that (when not masked) forces the CPU 
into an ISR. This signal can be triggered by an external device, an on- 
chip peripheral, or an instruction (TRAP, for example). 


interrupt acknowledge (IACK): A signal indicating that an interrupt has 
been received and that the program counter is fetching the interrupt vec- 
tor location. 


interrupt-trap table pointer (ITTP): A bit field in the status register that indi- 
cates the starting location (base address) of the interrupt-trap vector 
table. The base address is formed by left-shifting the value of the ITTP 
bit field by 8 bits. 


ISR: /nterrupt service routine. A module of code that is executed in 
response to a hardware or software interrupt. 


ITTP: See interrupt-trap table pointer. 


LSB: Least significant bit. The lowest-order bit in a word. 
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machine cycle: See CPU cycle. 


mantissa: A component of a floating-point number consisting of a fraction 
and a sign bit. The mantissa represents a normalized fraction whose 
binary point is shifted by the exponent. 


maskable interrupt: A hardware interrupt that can be enabled or disabled 
through software. 


memory-mapped register: One of the on-chip registers that point to ad- 
dresses in memory. Some memory-mapped registers point to data 
memory, and somepoint to input/output memory. 


memory width: The number of bits that can be stored in a single external 
memory address. 


MFLOPS: Millions of floating point operations per second. A measure of 
floating-point processor speed that counts of the number of floating-point 
operations made per second. Also called megaflops. 


microcomputer mode: A mode in which the on-chip ROM (boot loader) is 
enabled. This mode is selected via the MP/MCBL pin. 


microprocessor mode: A mode in which the on-chip ROM is disabled. This 
mode is selected via the MP/MCBL pin. See also MP/MC pin. 


MIPS: Million instructions per second. 


miss: A condition in which, when the processor fetches an instruction, it is 
not available in the cache. 


MSB: Most significant bit. The highest-order bit in a word. 


multiplier: A device that generates the product of two numbers. 


NMI: Nonmaskable interrupt. A hardware interrupt that uses the same logic 
as the maskable interrupts but cannot be masked. 
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overflow flag (OV) bit: A status bit that indicates whether or not an arithme- 
tic operation has exceeded the capacity of the corresponding register. 


PC: Program counter. A register that contains the address of the next 
instruction to be fetched. 


peripheral bus: A bus that is used by the CPU to communicate to the DMA 
coprocessor, communication ports, and timers. 


pipeline: A method of executing instructions in an assembly-line fashion. 


RC: See repeat counter register. 


read/write (R/W) pin: A memory-control signal that indicates the direction 
of transfer when communicating to an external device. 


register file: A bank of registers. 


repeat-counter (RC) register: A 32-bit register in the CPU register file that 
specifies the number of times to repeat a block of code when performing 
a block repeat. 


repeat mode: A zero-overhead method for repeating the execution of a 
block of code. Using repeat modes allows time-critical sections of code to 
be executed in the shortest possible time. 


reset: A means to bring the CPU to a known state by setting the registers 
and control bits to predetermined values and signaling execution to fetch 
the reset vector. 


reset pin: A signal that causes the device to reset. 


R/W: See read/write pin. 
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short floating-point format: A 16-bit representation of a floating point num- 
ber with a 12-bit mantissa and a 4-bit exponent. 


short floating-point format for external 16-bit data: A 16-bit representa- 
tion of a floating point number with an 8-bit mantissa and an 8-bit expo- 
nent. 


short integer format: A 2s-complement,16-bit format for integer data. 
short unsigned-integer format: A 16-bit unsigned format for integer data. 


sign-extend: The process of filling the high-order bits of a number with the 
sign bit. 


single-precision floating-point format: A 32-bit representation of a float- 
ing-point number with a 24-bit mantissa and an 8-bit exponent. 


single-precision integer format: A 2s-complement 32-bit format for inte- 
ger data. 


single-precision unsigned-integer format: A 32-bit unsigned format for 
integer data. 


software interrupt: An interrupt caused by the execution of a TRAP instruc- 
tion. 


ST: See status register. 


stack: A block of memory reserved for storing and retrieving data on a first-in 
last-out basis. It is usually used for storing return addresses and for pre- 
serving register values. 


status register: A register in the CPU register file that contains global in- 
formation relating to the current state of the CPU. 


timer: A programmable peripheral that generates pulses for timed events. 


timer-period register: A 32-bit memory-mapped register that specifies the 
period for the on-chip timer. 
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wait state: A period of time that the CPU must wait for external program, 
data, or I/O memory to respond when it reads from or writes to that exter- 
nal memory. The CPU waits one extra cycle for every wait state. 


wait-state generator: A program that can be modified to generate a limited 
number of wait states for a given off-chip memory space (lower program, 
upper program, data, or I/O). 


XAO-XA13: External address pins for data/program memory or I/O devices. 
These pins are on the expansion bus of the C30. See also AO-A23. 


XDO-XD31: External data bus pins that transfer data between the processor 
and external data/program memory or I/O devices of the C30. See also 
DO-D31. 


zero fill: The process of filling the low- or high-order bits with Os when load- 
ing a number into a larger field. 


16-bit-wide configured memory, 
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2-operand instruction 13-3 
2-operand instruction word 8-25 
3-operand addressing modes 2-17, 13-24—13-25 


3-operand instruction 13-4 
add, integer 13-58 
arithmetic shift 13-73 
bitwise-exclusive OR 13-250 
bitwise-logical 
ANDN_ 13-69 
OR 13-190 
compare 
floating-point value 13-90 
integer 13-93 
logical shift 13-138 
multiply 
floating-point value 13-147 
integer 13-161 
subtract 
floating-point value 13-230 
integer 13-235 
with borrow 13-224 
test bit fields 13-247 


3-operand instruction word 8-25 


32-bit-wide configured memory, 
TMS320C31/VC33 11-10 


A/D converter, definition D-1 
AO-A30, definition D-1 
absolute value of 


floating-point instruction (ABSF) 13-41 
integer instruction (ABSI) 13-44 
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add 
floating-point value instruction (ADDF) 13-51 
3-operand instruction 13-53 
integer 
(ADDI) 13-57 
3-operand instruction 13-58 
with carry instruction (ADDC) 13-48 
3-operand instruction 13-49 
address, definition D-1 
address pins, external D-8 
addressing 6-1-6-32 
modes 
3-operand 2-17, 13-24—13-25 
branch 2-17 
definition D-1 
general 2-17, 13-20-13-23 
groups 13-20-13-27 
parallel 2-17, 13-25-13-26 
register 6-3-6-29 
types 6-2 
bit-reversed 6-26-6-27 
circular 6-21-6-25 
direct 6-4-6-29 
immediate 6-18-6-29 
indirect 6-5—-6-29 
PC-relative 6-19-6-20 
ALU. See arithmetic logic unit 
applications, general listing 1-7 
AR 
read of 8-8 
write to 8-7 
architectural block diagram 
TMS320C30 2-3 
TMS320C31/VC33 2-4 
TMS320C32 2-5 
architecture 2-2 
arithmetic, shift instruction (ASH) 13-71 
arithmetic logic unit (ALU) 2-8 
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Index 


arithmetic logic unit (ALU), definition D-1 
assembler syntax expression, example 13-38 


assembly language, instruction set 
2-operand instructions 13-3 
3-operand instructions 13-4 
interlocked operations instructions 13-5—-13-6 
load and store instructions 13-2 
low-power control instructions 13-5 
program control instructions 13-4—13-5 


assembly language instructions 13-1—13-37 
3-operand instruction 
add 
floating-point value 13-53-13-54 
integer with carry 13-49-13-50 
arithmetic shift 13-73-13-75 
bitwise-exclusive OR 13-250-13-251 
bitwise-logical 
AND 13-63-13-64 
AND with complement 13-69-—13-70 
OR 13-190—13-191 
compare 
floating-point value 13-90-13-91 
integer 13-93-13-94 
logical shift 13-138—13-140 
multiply 
floating-point value 13-147—-13-148 
integer 13-161-13-162 
subtract 
floating-point value 13-230-13-231 
integer 13-235-13-236 
with borrow 13-224—13-225 
test bit fields 13-247-13-248 
absolute value of 
floating-point (ABSF) 13-41 
integer (ABSI) 13-44—13-45 
add 
floating-point value (ADDF) 13-51-13-52 
integer (ADDI) 13-57 
3-operand instruction 13-58—13-59 
integer with carry (ADDC) 13-48 
arithmetic shift (ASH) 13-71-13-72 
bitwise-exclusive OR (XOR) 13-249 
bitwise-logical 
AND 13-62 
with complement (ANDN) 13-67—13-68 
complement (NOT) 13-184-13-185 
OR 13-188-13-189 
branch 
conditionally 
delayed (BcondD) 13-81-13-82 
standard (Bcond) 13-79-13-80 
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assembly language instructions (continued) 
unconditionally 
delayed (BRD) 13-84 
standard (BR) 13-83 
call, subroutine 
(CALL) 13-85 
conditionally (CALLcond) 13-86—13-87 
categories 
2-operand 13-3 
3-operand 13-4 
interlocked operation 13-5—13-6 
load and store 13-2 
low-power control 13-5 
program control 13-4—13-5 
compare 
floating-point value (CMPF) 13-88—13-89 
integer (CMPI) 13-92 
decrement and branch, conditionally 
delayed instruction (DBcondD) 13-97-13-98 
standaard instruction (DBcond) 13-95—13-96 
divide clock by 16 (LOPOWER) 13-135 
example instruction 13-38-13-40 
floating-point to integer conversion 
(FIX) 13-99-13-100 
idle until interrupt (IDLE) 13-109 
integer to floating-point conversion 
(FLOAT) 13-103-13-104 
interrupt, acknowledge (IACK) 13-107—13-108 
load 
data-page pointer (LDP) 13-134 
floating-point 
exponent (LDE) 13-112-13-113 
mantissa (LDM) 13-133 
value 
(LDF) 13-114 
conditionally (LDFcond) 13-115-13-116 
interlocked (LDFI) 13-117—13-118 
integer 
(LDI) 13-123-13-124 
conditionally (LDIcond) 13-125—-13-126 
interlocked (LDII) 13-127-—13-128 
logical shift (LSH) 13-136-13-137 
low-power idle (IDLE2) 13-110-—13-111 
multiply 
floating-point value (MPYF) 13-146 
integer (MPYI) 13-159-13-160 
negate 
floating-point value (NEGF) 13-174-13-175 
integer (NEGI) 13-178 
negative integer with borrow (NEGB) 13-173 
no operation (NOP) 13-181 
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assembly language instructions (continued) assembly language instructions (continued) 
normalize (NORM) 13-182-—13-183 rotate 
parallel instructions left (ROL) 13-204 
ABSF and STF 13-42 assembly language instructions (continued) 
ABSI and STI 13-46 left through carry (ROLC) 13-205—13-206 
ADDF3 and STF 13-55 right (ROR) 13-207 
ADDI3 and STI 13-60—13-61 right through carry (RORC) 13-208 
AND3 and STI 13-65—-13-66 round floating-point value 
ASH3 and STI 13-76-13-78 (RND) 13-202-13-203 
FIX and STI 13-101-13-102 signal, interlocked (SIGI) 13-213 
FLOAT and STF 13-105-13-106 software interrupt (SWI) 13-242 
LDF and LDF 13-119-13-120 Ue 
LDF and STF 13-121-13-122 ee ee ona 
LDiand LD! 13-129-13-130 interlocked (STFI) 13-215-13-216 
LDI and STI 13-131-13-132 integer 
LSH3 and STI 13-141-13-144 (STI) 13-219 
MPYF8 and ADDF3 13-149-13-152 interlocked (STII) 13-220 
MPYF3 and STF 13-153-13-154 subtract 
MPYF83 and SUBF3 13-155-13-158 floating-point value (SUBF) 13-228-—13-229 
MPYI3 and ADDI3 13-163-13-166 integer 
MPYI3 and STI 13-167-13-168 (UBT) tess 


conditionally (GSUBC) 13-226—13-227 


MPYI3 and SUBI3 13-169-13-172 with borrow (SUBB) 13-223 


NEGF and STF 13-176 


reverse 
NEGI and STI 13-179-13-180 floating-point value (SUBRF) 13-240 
NOT and STI 13-186-13-187 integer 

OR3 and STI 13-192-13-193 (SUBRI) 13-241 

STF and STF 13-217-13-218 with borrow (SUBRB) 13-239 
STland STI 13-221-13-222 symbols used to define instruc- 

SUBF3 and STF 13-232-13-233 MSHS Use lo7e! 


test bit fields (TSTB) 13-245-13-246 


Bl Tl 13-237-13-2 
ee ee eee ae trap conditionally (TRAPcond) 13-243-13-244 


XOR3 and STI 13-252-13-254 


POP auxiliary (AR7—ARO) registers 3-4 
floating-point value (POPF) 13-195 auxiliary register 
integer instruction 13-194 ALUs 2-8 
PUSH arithmetic unit (ARAU) 6-5 
floating-point value (PUSHF) 13-197 definition D-1 
integer 13-196 definition D-1 
register syntax 13-36 
repeat 
block (RPTB) 13-209-13-210 E} 
single instruction (RPTS) 13-211-13-212 
restore clock to regular speed bank switching 
(MAXSPEED) 13-145 example 9-14, 10-18 
return programmable 9-12-9-14 
from interrupt conditionally (RETI- bit-reversed, addressing 6-26—6-27 
cond) 13-198-13-199 definition D-1 
from subroutine conditionally (RETS- FFT algorithms 6-26-6-27 
cond) 13-200-13-201 bitwise-exclusive OR instruction (KOR) 13-249 
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bitwise-logical branch 
AND 13-62 addressing modes 2-17 
3-operand 13-63 conditionally 
with complement (ANDN) 13-67 delayed instruction (BcondD) 13-81 
complement instruction (NOT) 13-184 standard instruction (Bcond) 13-79 
OR instruction 13-188 conflicts 8-4 
delayed 7-9-7-10 
block execution 7-10 
diagram, TMS320C3x 1-3 incorrect use of 7-9 
repeat-mode incorrectly placed 7-7, 7-10 
control bits 7-3 incorrectly placed 7-6 
nested block repeats 7-8 unconditionally 
operation 7-3-7-4 delayed instruction (BRD) 13-84 
RC register value 7-7 standard instruction (BR) 13-83 
registers (RC, RE, RS) 7-2 bus 
restrictions 7-6—7-7 cycles 10-39 
RPTB instruction 7-4—7-5 IOSTRB 10-42 
RPTS instruction 7-5 STRBO 10-39 
size (BK) register 3-4 STRB1 10-39 
transfer completion 12-51 operation 
external 2-19 
block-repeat (RS, RE) registers 3-17 internal 2-18 
boot loader timing 10-39 
code description C-2 buses 
code listing C-4 data 2-18 
definition D-2 DMA 2-18 
flowchart C-3 program 2-18 
hardware interface, TMS320C32 11-23 busy-waiting loop, example 7-16 
interrupt and trap vector mapping 11-11 byte-wide configured memory, 
memory 11-19 TMS320C31/VC33 11-9 
precautions 11-13 
serial-port loading 11-11 
TMS320C31/VC33  11-2—11-13 
data stream 11-7 
description 11-2 Gacns i 
external memory loading 11-9 COnu Pe : 
memory load flowchart 11-5 Gabne Cea Bi(GC) see 
; cache enable bit (CE) 4-22 
mode selection ie cache freeze bit (CF) 4-22 
mode selection flowchart 11-3 hit 4-21 
sequence 11-4 instruction 2-16, 4-19 
serial port load flowchart 11-6 algorithm 4-21-4-22 
TMS320C32 11-14—11-24 memory 2-13 
data stream 11-20 architecture 4-19 
description 11-14 miss 4-21 
external memory interface 11-23 segment 4-21 
mode selection 11-14 subsegment 4-21 
mode selection flowchart 11-17 call, subroutine 
sequence 11-15 conditionally instruction (CALLcond) 7-11, 
serial port load flowchart 11-18 13-86 


Index-4 


instruction (CALL) 7-11, 13-85 
response timing 7-12 
carry bit, definition D-2 
carry flag 13-29 
central processing unit. See CPU 


circular addressing 6-21-6-25 
algorithm 6-23 
buffer 6-21-6-25 
definition D-2 
FIR filters 6-24 
Operation 6-23 


CLKX pins 12-22 


clock mode 
timer interrupt 12-13 
timer pulse generator 12-7-12-9 


clock periods, minor 8-24 


compare 
floating-point value instruction (CMPF) 13-88 
integer instruction (CMPI) 13-92 


comparison, feature set 2-27 
condition codes and flags 13-1, 13-30, 13-31 


condition flag 
floating-point underflow 13-29 
latched floating-point underflow 13-29 
latched overflow 13-29 
negative 13-29 
overflow 13-29 
zero 13-29 


conditional delayed branches 
compare instructions 7-9 
extended-precision registers 7-9 
context save/restore, definition D-2 
control register 12-51 
control registers, external interface 9-2 
expansion bus 9-9—9-15 
primary bus 9-7—9-8 
conversion 
floating-point to integer 5-41-5-42 
integer to floating-point 5-43 
counter register (timer) 12-3, 12-7 
CPU 
arbitration 12-63 
block diagram 2-7 
cycle, definition D-2 
definition D-2 
general 2-6 
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interrupt 
DMA interaction 7-40 
latency 7-35 

CPU (continued) 

processing, block diagram 7-34 
processing cycle 7-33 

primary registers 2-9 

register file 3-2 

registers 3-1—3-20 
auxiliary (AR7-ARO) 2-10 3-4 
block size (BK) 2-11, 3-4 
block-repeat (RS, RE) 3-17 
data-page pointer (DP) 2-10, 3-4 
extendea-precision (R7—RO) 2-10, 3-3 
VO flag (IOF) 2-11, 3-16 
index (IR1, IRO) 2-10, 3-4 
interrupt flag (IF) 2-11, 3-11, 7-32 

bits defined 3-13 

interrupt-enable (IE) 2-11, 3-9, 7-32 
list of 3-2 
program-counter (PC) 2-18, 3-18 
repeat end-address (RE) 2-11, 3-17, 7-2 
repeat start-address (RS) 2-11, 3-17, 7-2 
repeat-counter (RC) 2-11, 3-17, 7-2 
reserved bits and compatibility 3-19 
status (ST) 2-11, 3-5, 7-32, 13-29 
system-stack pointer (SP) 2-11, 3-4 

transfer, with serial-port transmit pol- 
ling 12-43-12-44 


DO-D31, definition D-2 
data, buses 2-18 
data formats 5-1—5-48 
floating-point 
addition and subtraction 5-32-5-36 
conversion to integer 5-41-5-42 
floating-point formats 5-4—5-13 
conversion between formats 5-12-5-15 
extendea-precision 5-8-5-9 
short 5-5—5-6 
single-precision 5-7—-5-8 
integer 5-2 
single-precision 5-2-5-3 
unsigned 5-3 
integer to floating-point conversion 5-43 
normalization using NORM instruc- 
tion 5-37-5-38 
rounding with RND instruction 5-39-5-40 
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unsigned-integer formats, single-preci- 
sion 5-3-5-5 
data memory, TMS320C32 2-20 
data-rate timing operation 
fixed 12-36 
burst mode 12-36 
continuous mode 12-36 
variable 12-39 
burst mode 12-35 
continuous mode 12-40 
data-page pointer (DP) 2-10, 3-4 
data-receive register (DRR) 12-28 
serial port 12-28-12-29 
data-transfer operation, handshake 11-20 
data-transmit register (DXR) 12-28, 12-32, 12-36, 
12-37 
data-address generation logic, definition D-2 
data-page pointer (DP), definition D-2 
DBR instruction 8-8 
decode phase, definition D-3 
decrement and branch 
conditionally 
delayed instruction (DBcondD) 13-97 
standard instruction (DBcond) 13-95 
instruction (DBR) 8-8 
delayed branch 7-9-7-10 
correct device operation 7-50 
example 8-6 
incorrectly placed 7-7 
dequeue (stacks) 6-29, 6-31 
destination-address register 12-51 
direct addressing 6-4—6-29 
direct memory access (DMA) 2-24 
disabled interrupts by branch 7-9 
displacements 
indirect addressing 6-5 
PC-relative addressing 6-19 
divide clock by 16 instruction (LOPOWER) 13-135 
DMA 
architecture 2-24 
block moves 12-48 
buses 2-18 
controller 2-18, 2-24, 12-48-12-68 
2-channel, TMS320C32 12-49 
address generation 12-57 
arbitration 12-63 
basic configuration 12-51 


Index-6 


basic operation 12-50 
block diagram 2-25 
channel synchronization 12-65-12-67 
DMA (continued) 
functional description 12-48 
global-control register 12-53 
internal priority schemes for ’C32 12-62 
interrupts 12-64 
priorities 12-62 
register 12-51 
transfer-counter register 12-58 
coprocessor, definition D-3 
destination register 12-67-12-68 
destination/source address regis- 
ter 12-57-12-59 
Initialization/reconfiguration 12-73 
interrupt, CPU interaction 7-40 
interrupt-enable register 12-59-12-62 
interrupts 7-38 
control bits 7-38 
processing, block diagram 7-39, 7-40 
memory transfer 12-67—12-68 
single DMA timing 12-68 
PRI and CPU/DMA arbitration rules for 
C32 «12-64 
registers, initialization 12-50 
setup and use examples 12-74—12-80 
source register 12-67—12-68 
start 12-50 
timing 
expansion bus destination 12-72 
on-chip destination 12-69 
primary bus destination 12-70 12-71 
transfer-counter register 12-58—12-59 
word transfers 12-50 
dual-access RAM, definition D-3 
DX pins 12-22 


event counters 12-2 
example instruction 13-38-13-40 
execute only 8-12 
parallel store followed by single read 8-14 
single store followed by two reads_ 8-13 
expansion bus 9-2 
control register 9-9-9-15 
bits described 9-9 
functional timing of operations 9-15 
VO cycles 9-21-9-36 


interface, signals 9-5 
programmable wait states 9-10-—9-11, 
10-15—10-16 
extended-precision 
(R7-RO) registers 3-3 
definition D-3 
floating-point format, definition D-3 
external 
buses (expansion, primary) 2-19 
interface 
control registers 9-2 
memory map 9-6 


timing, expansion bus I/O cycles 9-21-9-36 


timing, primary bus cycles 9-15-9-20 
interrupt 7-26, 7-36 
buses (expansion, primary) 2-21 
definition D-3 
interlocked-instruction signaling 2-21 
memory interface 9-1—9-38 
configurations 10-7 
control registers 10-7 
features 10-2 
overview 10-3 
timing 9-15—9-38 
reset signal 7-21 
external bus operation 
external interface control registers 9-2 
expansion bus 9-9-9-15 
primary bus 9-7-9-8 
external interface timing 
expansion-bus I/O cycles 9-21-—9-36 
primary-bus cycles 9-15—9-20 
external memory interface timing, expansion 
bus 9-15-9-38 
programmable 
bank switching 9-12-9-14 
wait states 9-10-9-11, 10-15—10-16 


FFT algorithms, using bit-reversed addressing to 
implement 6-26 

FIFO buffer, definition D-3 

FIR filters, implementation using circular addres- 
sing 6-24 

FlX instruction 5-41 
flowchart 5-42 

fixed data-rate timing operation 12-36 
burst-mode timing 12-36 


continuous-mode timing 12-36 
fixed priority, for ’C32 12-62 


flag 
carry 13-29 
condition 


floating-point underflow 13-29 
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latched floating-point underflow 13-29 


latched overflow 13-29 
negative 13-29 
overflow 13-29 
zero 13-29 
FLOAT instruction, flowchart 5-43 
floating-point 
addition, flowchart 5-33 
multiplication 
examples 5-29-5-31 
flowchart 5-28 
operation 5-1—5-48 
to integer conversion instruction (FIX) 
underflow condition flag 13-29 
values 
fractional 5-11 
negative 5-11 
positive 5-10 
floating-point format 5-4—5-13 


13-99 


2s-complement, converting IEEE format to 5-15 


addition and subtraction 5-32—5-36 

examples 5-34—5-36 
conversion 

between formats 5-12-5-13 

TMS320C3x to IEEE 5-22 

to IEEE standard 754, 5-14 

to integer 5-41-5-42 
converting integer to 5-43 
determining decimal equivalent 5-9 
extended-precision 5-8—5-9 

normalization 5-37-5-38 

rounding value 5-39-5-40 
multiplication 5-26 


short, for external 16-bit data, TMS320C32 5-6 


single-precision 5-7—-5-8 
frame sync 12-37, 12-38 
FSX pins 12-22 


general addressing modes 13-20-13-23 
global memory, sharing 7-17 
by multiple processors 7-13 
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global-control register 
DMA 12-53-12-59 
serial port 12-15, 12-17-12-21 
timer 12-3, 12-4-12-6 


handshake 11-20 

hardware interrupt, definition D-3 

hit, definition D-3 

hold cycles 9-37 

hold everything 8-15 
busy external port 8-16 
conditional calls andtraps 8-18 
multicycle data reads 8-17 


I/O flag (IOF) register 3-16 
bits defined 3-16 
CPU register file 3-16 
definition D-4 
I/O flags, external 2-21 
IACK instruction 7-35 
IACK signal, definition D-4 
idle until interrupt instruction (IDLE) 13-109 
IDLE2 
interrupt response timing 7-51 
power-down mode /7-49-—7-51 
timing 7-50 
IEEE format, converting floating-point format 
to 5-21 
IIOF flag register (IIF), definition D-2 
immediate addressing 6-18-6-29 
inactive bus states 10-51 
index registers 
(IR1,1IRO) 3-4 
definition D-4 
indirect addressing 6-5-6-29 
3-operand addressing mode 13-25 
ARAUs_ 6-5 
auxiliary register 6-5 
parallel addressing mode 13-26 
with postdisplacement 6-11 
with postinde 
6-15-6-18 
with predisplacement 6-9-6-11 
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indirect addressing (continued) 
with preinde 6-13-6-15 
instruction 
2-operand 13-3 
3-operand 13-4 
cache 4-19 
algorithm 4-21 
TMS320C32 2-16 
CALL 7-11 
CALLcond 7-11 
DBR 8-8 
FIX 5-41 
FLOAT 5-43 
IACK 7-35 
IDLE2 7-49 
interlocked operations 13-5—-13-6 
ISR 7-35 
LDFI 7-14 
LDII 7-14 
load and store 13-2 
low-power control operations 13-5 
LOWPOWER_ 7-51 
NOP 7-45 
NORM _ 5-37 
POP ST 7-41 
program control 13-4—13-5 
PUSH ST 7-41 
RETlcond 7-12, 7-48 
RETScond 7-11 
RND 5-39 
RPTB_ 7-2, 7-4 
RPTS 7-2, 7-5 
SIGI 7-14 
STFI 7-14 
STII 7-14 
TRAPcond 7-11 


instruction register (IR) 2-18 
instruction register (IR) 3-18 


instruction set 13-38—-13-172 
example instruction 13-38—-13-40 
summary, table 13-10 


integer format 5-2 
short integer 5-2 
sign-extended 5-2 
single-precision 5-2 
unsigned 5-3 

integer to floating-point conversion 
instruction (FLOAT) 13-103 
using the FLOAT instruction 5-43 


interface 
enhanced memory, TMS320C32 2-19 
expansion bus 2-19 
primary bus 2-19 


interlocked 

instructions 2-21 

operations 7-13-—7-20 
busy-waiting loop 7-15 
external flag pins (XFO, XF1) 7-13 
instructions 13-5—13-6 
instructions used in 7-13 
LDFI and LDIl instructions 7-14 
loads and stores 7-13 
multiprocessor counter manipulation 7-16 
STFI and STll instructions 7-14 


internal 
bus operation 2-18 
buses 2-8 
clock 12-10 
interrupt 7-26 
definitions D-4 
enable register, definition D-4 


interrupt 7-26—7-37 
acknowledge, instruction (IACK) 13-107 
acknowledge signal, definition D-4 
considerations 
TMS320C30 7-44—7-47 
TMS320C3x_ 7-41—7-43 
control bits 7-32 
interrupt enable register (IE) 7-32 
interrupt flag register (IF) 7-32 
status register (ST) 7-32 
CPU/DMA interaction 7-40 
definition D-4 
DMA 7-38, 12-64 
edge-triggered 12-64 
external 2-21, 7-36 
flag register (IF), behavior 7-32 
initialization 7-47 
latency (CPU) 7-35-7-36 
locations 3-15 
logic, functional diagram 7-37 
prioritization 7-31 
processing 7-33—7-35 
block diagram 7-34, 7-39 
serial port 12-34 
receive timer 12-34 
transmit timer 12-34 
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interrupt (continued) 
service routine (ISR) 7-35, 7-50 
instruction 7-35 
timer 12-2, 12-13 
vector table 
TMS320C30 and 
TMS320C31/VC33  7-26—7-28 
TMS320C32  7-29-7-30 
interrupt and trap 
branch instructions, TMS320C31/VC33, micro- 
computer mode 4-17 
vector locations, TMS320C32 4-18, 7-30 
interrupt service routine (ISR), definition D-4 
interrupt-enable (IE) register 
bits defined 3-10 
CPU register file 3-9 
interrupt-trap table pointer (ITTP) 3-14 
definition D-4 
interrupts, level-triggered 12-64 
IOSTRB 
bus cycles 10-42 
control register 10-9 
signal 9-3, 9-15 
ISR. See interrupt service routine (ISR) 


LAO-LA30, definition D-8 
latched 
floating-point underflow condition flag 13-29 
overflow condition flag 13-29 
LDO-LD31, definition D-8 
LDFl instruction 7-14 
LDll instruction 7-14 
load 
data-page pointer instruction (LDP) 13-134 
floating-point 
exponent instruction (LDE) 13-112 
mantissa instruction (LDM) 13-133 
value 
(LDF) 13-114 
conditionally instruction (LDFcond) 13-115 
interlocked instruction (LDFI) 13-117 
integer 
conditionally instruction (LDIcond) 13-125 
instruction (LDI) 13-123 
interlocked instruction (LDII) 13-127 
load and store instructions 13-2 
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logical shift instruction (LSH) 13-136 


LOPOWER_ 7-51-7-52 
timing 7-52 
low-power 
control instructions 13-5 
idle instruction (IDLE2) 13-110 


LRU cache update 4-19 
LSB, definition D-4 


mantissa, definition D-5 
maskable interrupt, definition D-5 
MAXSPEED, timing 7-52 


memory 4-2 
accesses 
2-operand instructions 8-25 
3-operand instructions 8-25 
data access 8-22 
data loads and stores 8-24 
internal clock 8-24 
pipeline 8-24 
program fetch 8-22, 8-24 
timing 8-24 
two data accesses 8-23 
addressing modes 2-17 
cache 2-13 
configured, TMS320C31/VC33, byte-wide 
conflicts 8-4, 8-9, 8-22 
execute only 8-12 
hold everything 8-15 
program fetch incomplete 8-11 
program wait 8-9 
data, TMS320C32 2-20 
DMA memory transfer 12-67—12-68 
enhanced interface, TMS320C32 2-19 
general organization 2-13 
global, sharing 7-17 
by multiple processors 7-13 
interface 
16-bit wide 10-26 
32-bit wide 10-20 
8-bit wide 10-32, 10-38 
control registers 9-7 
signals 9-3 
maps 
peripheral bus 
TMS320C30 4-9 
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TMS320C31/VC33 4-11 
memory (continued) 
TMS320C32 4-12 
TMS320C30 4-2, 4-4 
TMS320C31/VC33 4-6 
microcomputer mode 
TMS320C30 4-3 
TMS320C31/VC33 4-5 
TMS320C32 4-7 
microprocessor mode 
TMS320C30 4-2 
TMS320C31/VC33 4-5 
TMS320C32 4-7 
organization, block diagram 
TMS320C30 2-14 
TMS320C31/VC33 2-15 
TMS320C32 2-16 
parallel 
multiplies and adds_ 8-30 
stores 8-29 
pipeline conflicts 8-8 
program, TMS320C32 2-19 
timing 8-24 
TMS320C31/VC33, configured 
16-bit-wide 11-10 
32-bit-wide 11-10 
widths 
16-bit with 16-bit data type size 10-28 
16-bit with 32-bit data type size 10-30 
16-bit with 8-bit data type size 10-26 
32-bit with 16-bit data type size 10-22 
32-bit with 32-bit data type size 10-24 
32-bit with 8-bit data type size 10-20 
8-bit with 16-bit data type size 10-34 
8-bit with 32-bit data type size 10-35 
8-bit with 8-bit data type size 10-32 
memory-mapped register, definition D-5 
MFLOPS, definition D-5 
microcomputer mode, definition D-5 
microprocessor mode, definition D-5 
MIPS, definition D-5 
miss, definition D-5 
modes, boot loader 
flowchart 
TMS320C31/VC33 11-3 
TMS320C32 11-17 
mode selection 


TMS320C031/VC33_ 11-2 
TMS320C32 11-14 


MSB, definition D-5 
MSTRB signal 9-3, 9-15 


multiple processors, sharing global memory 7-13 
multiplication, floating-point, examples 5-29-5-31 


multiplier 
definition D-5 
floating-point/integer 2-8 
multiply 


floating-point value instruction (MPYF) 13-146 


integer instruction (MPYI) 13-159 


or CPU operation with a parallel store, instruction 


word format 8-29 


multiprocessor 
counter manipulation, example 7-16 
support, through interlocked operations 7-13 


negate 


floating-point value instruction (NEGF) 13-174 


integer instruction (NEGI) 13-178 
negative 
condition flag 13-29 


integer with borrow instruction (NEGB) 13-173 


nested block repeats 7-8 
no operation instruction (NOP) 13-181 
normalization, floating-point value 5-37—5-38 
normalize instruction (NORM) 13-182 
flowchart 5-38 
using 5-37—-5-38 


output value formats 13-28 


overflow 5-41 
condition flag 13-29 
flag (OV) bit, definition D-6 


parallel 
addressing modes 2-17, 13-25—13-26 
instruction set summary table 13-17-13-19 
multiplies and adds, instruction word for- 
mat 8-30 


parallel Instructions 


ADDI3 and STI 13-60—13-61 
FIX and STI 13-101—13-102 


parallel instructions 


ABSF and STF 13-42-13-43 

ABSI and STI 13-46—13-47 
ADDF3 and STF 13-55-13-56 
ASH3 and STI 13-76-13-78 
FLOAT and STF 13-105—13-106 
LDF andLDF 13-119-13-120 

LDF and STF 13-121 

LDIl and LDI 13-129-13-130 

LDl and STI 13-131-13-132 
LSH3 and STI 13-141-13-144 
MPYF3 and ADDF3_ 13-149-13-152 
MPYF3 and STF 13-153-13-154 
MPYF3 and SUBF3 13-155—13-158 
MPYI3 and ADDI3 _13-163-13-166 
MPYI3 and STI 13-167—13-168 
MPYI3 and SUBI3  13-169-13-172 
NEGF and STF 13-176-13-177 
NEGI and STI 13-179-13-180 
NOT and STI 13-186—13-187 
OR3 and STI 13-192—13-193 

STF and STF 13-217-13-218 
STland ST] 13-221-13-222 
SUBF3 and STF 13-232-13-233 
SUBI3 and ST! 13-237-13-238 
XOR3 and STI 13-252—13-254 


PC-relative addressing 6-19-6-20 
period register (timer) 12-3, 12-7 


peripheral 


bus 
definition D-6 
memory-mapped registers 
TMS320C30 4-10 
TMS320C31/VC33 4-11 
TMS320C32 4-13 
peripherals on 
DMA controller 12-48—12-68 
serial port 12-15-12-47 
timers 12-2 
register diagram 2-22 
serial ports 2-23 
timers 2-23 
modules, block diagram 2-22 
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peripherals 12-1—12-68 
DMA controller 12-48-12-68 
CPU/DMA interrupt enable regis- 
ter 12-59-12-62 
destination- and source-address regis- 
ters 12-57—-12-59 
global-control register 12-53-12-59 
Initialization/recontiguration 12-73 
memory transfer timing 12-67—12-68 
programming examples 12-74—12-80 
transfer-counter register 12-58-12-59 
general architecture 2-22 
serial ports 12-15—-12-47 
data-transmit register 12-28 
data-receive register 12-28—-12-29 
FSR/DR/CLKR port control regis- 
ter 12-23-12-24 
FSX/DX/CLKX port control regis- 
ter 12-22-12-23 
functional operation 12-35-12-41 
global-control register 12-17-12-21 
initialization/reconfiguration 12-41 
interrupt sources 12-34 
operation configurations 12-29-12-31 
receive/transmit timer control regis- 
ter 12-25-12-27 


receive/transmit timer counter register 12-27 
receive/transmit timer period register 12-28 


timing 12-31-12-34 
TMS320C3x interface exam- 
ples 12-41-12-48 
timers 12-2—12-14 
global-control register 12-4-12-6 
initialization/recontiguration 12-13-12-17 
interrupts 12-13 
operation modes 12-10-12-12 
period and counter registers 12-7 
pulse generation 12-7-12-9 


pin operation, states atreset 7-21 


pipeline 
conflicts 8-4 
branch 8-4 


memory 8-8, 8-9 
register 8-6 
resolving (memory) 8-22 
decode unit 8-2 
definition D-6 
execute unit 8-2 
fetch unit 8-2 
memory accesses 8-24 
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pipeline (continued) 
operation 7-42 
introduction 8-1 
read unit 8-2 
structure 8-2 
major units 8-2 


POP 
floating-point value instruction (POPF) 13-195 
integer instruction 13-194 


power-management modes _ 7-49-—7-52 
IDLE2 7-49-7-51 
primary bus 9-2 
bus cycles 9-15—9-20 
control register 9-7—9-8 
bits described 9-7 
BNKCMP and bank size 9-12, 10-17 
full speed accesses 9-15 
functional timing of operations 9-15 
interface, signals 9-4 
programmable 
bank switching 9-13, 10-18 
wait states 9-10-9-11, 10-15—10-16 


program 
buses 2-18 
control, instructions 13-4—-13-5 
counter, definition D-6 
fetch 
incomplete 8-11 
multicycle program memory fetches 8-12 
flow control 7-1—7-52 
calls, traps, and returns 7-11—7-12 
delayed branches 7-9-—7-10 
interlocked operations 7-13—7-20 
interrupt vector table, 
TMS320C32 7-29-7-30 
interrupts 7-26—7-37 
control bits 7-32 
CPU interrupt latency 7-35—7-36 
CPU/DMA interaction 7-40 
prioritization 7-31 
processing 7-33-7-35 
TMS320C30 considerations 7-44—7-47 
TMS320C3x considerations 7-41—7-43 
vector table 7-26—7-28 
power-management mode 7-49-—7-52 
repeat modes /7-2-7-8 
nested block repeats 7-8—7-15 
RC register value after repeat mode 7-7 
repeat-mode control bits 7-3 
repeat-mode operation 7-3—7-4 
restrictions 7-6—7-7 


program (continued) 
RPTB instruction 7-4—7-5 
RPTS instruction 7-5—7-6 
reset operation 7-21-7-25 
TMS320LC31/VC33 power management 
mode 
IDLE2 7-49-7-51 
LOPOWER_ 7-51-—7-52 
memory 2-19 
wait 
due to multicycle access 8-11 
until CPU data access completes 8-10 
program-counter (PC) register 2-18, 3-18 
programmable 
bank switching 9-12—9-14 
wait states 9-10—9-11, 10-15-10-16 
pulse mode 


timer interrupt 12-13 
timer pulse generator 12-7-12-9 


PUSH 


floating-point value instruction (PUSHF) 13-197 


integer instruction 13-196 


queue (stacks) 6-29, 6-31 


RAM. See memory 


RC register value, after repeat mode com- 
pletes 7-7 


read/write (R/W) pin, definition D-6 
receive shift register (RSR) 12-28 


receive/transmit timer 
control register (serial port) 12-25-12-27 
counter register (serial port) 12-27 
period register (serial port) 12-28 
register 
addressing 6-3-6-29 
conflicts 8-4 
file 
CPU 2-9 
definition D-6 
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registers 
buses 2-18 
CPU 2-9 


auxiliary (AR7-ARO) 2-10, 3-4 
block size (BK) 2-11, 3-4 
block-repeat (RS, RE) 3-17 
data-page pointer (DP) 2-10, 3-4 
extendea-precision (R7—RO) 2-10, 3-3 
condition flags 13-39 
extendea-precision registers (R7—RO) 7-9 
VO flag (IOF) 2-11, 3-16 
index (IR1, IRO) 2-10, 3-4 
interrupt flag (IF) 2-11, 3-11 
asynchronous accesses 7-45 
interrupt-trap table pointer (ITTP) bit 3-14 
interrupt-enable (IE) 2-11, 3-9, 12-59-12-62 
repeat end-address (RE) 7-2 
repeat start-address (RS) 7-2 
repeat-counter (RC) 2-11, 3-17, 7-2 
status (ST) 2-11, 3-5, 13-29 
system-stack pointer (SP) 2-11, 3-4, 6-29 
DMA 
destination and source address 12-57—12-59 
global-control register 12-53-12-59 
transfer-counter register 12-58-12-59 
DMA channel control 7-38 
instruction (IR) 2-12, 3-18 
interrupt-enable (IE) 7-38 
memory map, external memory interface 10-7 
IOSTRB control 10-9 
STRBO 10-8 
STRB1 control 10-8 
peripherals 
receive/transmit timer control 12-25 
serial port 12-15-12-47 
FSR/DR/CLKR- 12-23 
FSX/DX/CLKX 12-22 
global-control 12-17-12-21 
timer 12-3 
counter 12-7 
global-control 12-4 
period 12-7 
pipeline conflicts 8-6 
program-counter (PC) 2-12, 2-18, 3-18 
repeat mode operation 7-3—7-4 
reserved bits and compatibility 3-19 


repeat 


block instruction (RPTB) 13-209 
See also RPTB instruction 

single instruction (RPTS) 13-211 
See also RPTS instruction 
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repeat end-address (RE) register 3-17, 7-2 
repeat mode, definition D-6 


repeat modes 7-2-7-8 
control algorithm 7-4 
control bits 7-3 
maximum number of repeats 7-3 
nested block repeats 7-8 
operation 7-3-7-4 
RC register value 7-7 
restrictions 7-6—7-7 
RPTB instruction 7-4—7-5 
RPTS instruction 7-5 
repeat start-address (RS) register 3-17, 7-2 
repeat-counter (RC) register 3-17, 7-2 
definition D-6 
reset 4-14 
definition D-6 
operation 7-21—7-25 
performed 7-25 
pinstates 7-21 


reset and interrupt vector priorities 7-31 
reset pin, definition D-6 


reset, interrupt, and trap vector, locations, micropro- 
cessor mode, TMS320C31/VC33 4-16 


reset/interrupt,/trap vector, locations, microproces- 
sor mode, TMS320C30 and 
TMS320C31/VC33 7-27 


reset/interrupt/trap vector 
locations 
microcomputer boot mode, 
TMS320C31/VC33 7-28 
microprocessor mode, TMS320C30 4-15 
map 
microcomputer mode 4-14 
microcomputer/boot-loader mode 4-14 
microprocessor and microcomputer/boot-load- 
ermode 4-14 
microprocessor mode 4-14 


restore clock to regular speed instruction 
(MAXSPEED) 13-145 


RETIcond instruction 7-48 


return 
from interrupt conditionally instruction (RETI- 
cond) 7-12, 13-198 
from subroutine 7-11 
from subroutine conditionally (RETS- 
cond) 13-200 
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from subroutine conditionally instruction (RETS- 
cond) 7-11 
RND instruction 5-39 
flowchart 5-40 
ROM. See memory 
rotate 
left instruction (ROL) 13-204 
left through carry (ROLC) 13-205 
right instruction (ROR) 13-207 
right through carry instruction (RORC) 13-208 
rotating priority, for’C32 12-63 
round floating-point value instruction 
(RND) 13-202 
rounding of floating-point value 5-39-5-40 
RPTB instruction 7-4—7-5 
nesting 7-8 
pipeline conflict in 7-7 
to flush pipeline 8-5 
RPTS instruction 7-5—7-6 
to flush pipeline 8-5 


segment start address (SSA) 4-19 
semaphores, using in critical sections 7-17 
serial port 12-15-12-47 
block diagram 12-16 
clock 12-15, 12-31 
configurations 12-29-12-31 
timer 12-42 
timing 12-31-12-34 
continuous transmit and receive mode _ 12-33 
CPU transfer with transmit polling 12-43-12-44 
data-receive register 12-28—12-29 
data-transmit register 12-28 
fixed data-rate timing 12-36 
burst mode 12-36 
continuous mode 12-36 
frame sync 12-37, 12-38 
functional operation 12-35-12-41 
global-control register 12-15, 12-17-12-21 
handshake mode 12-19, 12-33-12-35, 12-42, 
12-43 
direct connect 12-34 
initialization reconfiguration 12-41—-12-47 
interface examples 
handshake mode example 12-42-12-43 
serial A/C interface example 12-45 
serial A/D and DIA interface exam- 
ple 12-46—12-48 


interrupt sources 12-34 


serial port (continued) 
loading 11-11 
memory mapped locations for 12-17 
operation configurations 12-29-12-31 
port control register 
FSR/DR/CLKR 12-23-12-24 
FSX/DX/CLKX  12-22-12-23 
receive/transmit timer 
control register 12-25-12-27 
counter register 12-27 
period register 12-28 
registers 12-15, 12-47 
timing 12-31-12-34 
short 
floating-point format, definition D-7 
integer format 5-2 
definition D-7 
unsigned integer format, definition D-7 
SIGI instruction 7-14 
timing diagram for 7-15 
signal, interlocked instruction (SIGI) 13-213 
sign-extend, definition D-7 
single-precision 
floating-point format, definition D-7 
integer format 5-2 
definition D-7 
unsigned integer format, definition D-7 
software interrupt 
definition D-7 
instruction (SWI) 13-242 
source-address register 12-51 


stack 6-30-6-31 
building 6-30 
definition D-7 
implementation of high-to-low memory 6-30 
implementation of low-to-high memory 6-31 
management 6-29-6-32 
pointer 6-29 
standard branch 7-9 
example 8-5 
status (ST) register 3-5, 13-29 
CPU register file 3-5 
definition D-7 
global interrupt enable (GIE) bit 
‘C30 interrupt considerations 7-44 
‘C3x interrupt considerations 7-41 


STFl instruction 7-14 
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STll instruction 7-14 
store 
floating-point value 
(STF) 13-214 
interlocked (STFI) 13-215 
integer 
instruction (STI) 13-219 
interlocked (STII) 13-220 
STRB signal 9-3, 9-15 
STRBO control register 10-8 
STRB1 control register 10-8 
subtract 


floating-point value instruction (SUBF) 13-228 


integer 
(SUBI) 13-234 
conditionally instruction (SUBC) 13-226 
with borrow instruction (SUBB) 13-223 
reverse 
floating-point value instruction 
(SUBRF) 13-240 
integer 
(SUBRI) 13-241 
with borrow instruction (SUBRB) 13-239 
synchronization, DMA channels 12-65 
synchronize two processors, example 7-19 
system management 6-29-6-32 
system-stack pointer (SP) register 3-4, 6-29 


test bit fields instruction (TSTB) 13-245 
timer 2-23, 12-2-12-14 
block diagram 12-2 
control register 12-13 
receive/transmit 12-25-12-27 
counter 12-2 
counter register 12-3, 12-7 
receive/transmit 12-27 
definition D-7 
global-control register 12-3, 12-4-12-6 
I/O port configurations 12-10 
initialization/reconfiguration 12-13-12-17 
interrupts 12-13 
operation modes 12-10-12-12 
output generation examples 12-9 
period register 12-3, 12-7 
receive/transmit 12-28 
pulse generation 12-7—12-9 
registers 12-47 
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timing figure 12-8 
timer-period register, definition D-7 
timing 
external interface 
expansion bus I/O cycles 9-21-9-36 
primary bus cycles 9-15-9-20 
external memory interface 9-15—9-38 
TMS320C30 
architecture, block diagram 2-3 
DMA controller 12-49 
arbitration 12-63 
external memory interface 9-1—9-38 
interrupt vector table 7-26 
memory maps 4-4 
memory organization, block diagram 2-14 
serial ports 12-15 
timers 12-2 


TMS320C31/VC33 
architecture, block diagram 2-4 
bootloader 11-2 
DMA controller 12-49 
arbitration 12-63 
external memory interface 9-1—9-38 
interrupt and trap memory maps. 11-12 
interrupt vector table 7-26 
memory maps 4-5, 4-6 
memory organization, block diagram 2-15 
serial ports 12-15 
timers 12-2 


TMS320C32 

architecture, block diagram 2-5 
bootloader 11-14 
datamemory 2-20 
datatypes and sizes 2-20 
DMA controller 12-49 

arbitration 12-63 
external memory interface 2-19, 10-1-10-52 
interrupt vector table 7-29 
memory, external widths 2-20 
memory organization, block diagram 2-16 
program memory 2-19 
serial ports 12-15 
short floating-point format 5-4, 5-6 
timers 12-2 
trap vector locations 7-30 

TMS320C3x 

device differences 2-27 
devices 1-2 

compared 1-5 
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DSPs, introduction 1-1 
functional block diagram 1-3 
TMS320C3x (continued) 
key specifications 1-3 
serial port interface examples 12-41-12-48 
TMS320LC31/VC33, power management mode, 
LOPOWER_ 7-51-—7-52 
transfer-counter register 12-51 
trap 7-11-7-12 
conditionally instruction (TRAPcond) 7-11, 
13-243 
flow, block diagram 7-47 
initialization 7-47 
interrupt considerations, C30 7-44—7-46 
operation 7-47 
vector locations 3-15 
traps 4-14 
two parallel stores, instruction word format 8-29 


unsigned-integer format 5-3 
single-precision 5-3 


variable data-rate timing operation 12-39 
burst mode 12-35 
continuous mode 12-40 


wait state 

definition D-8 

generation 9-11 

programmable 9-10—9-11, 10-15—10-16 
wait-state generator, definition D-8 


XFO, XF1 signals 2-21 


zero condition flag 13-29 
zero fill, definition D-8 
zero wait-state 9-15 
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zero-logic interconnect of devices 7-18 
zero-overhead looping 7-2 


Index-17 


